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ARTIFICIAL PROTEINS WITH REDUCED IMMUNOGENICITY 

TECHNICAL FIELD OF THE INVENTION 

The invention relates to modified artificial proteins, preferably fusion proteins, having a 

5 reduced immunogenicity compared to the parent non-modified molecule when exposed to a 
species in vivo. Especially, the invention relates to proteins that are as single component 
normally not strongly immunogenic, but which have enhanced immunogenicity when attached 
to a second protein moiety to form an, as a rule, artificial fusion protein. The invention relates, 
above all, to modified and, thus, novel immunoglobulin (Ig) fusion proteins which essentially 

10 consist of an immunoglobulin molecule or a fragment thereof covalently fused via its C- 

terminus to the N-terminus of a biologically active non-immunoglobulin molecule, preferably 
a polypeptide or protein or a biologically active fragment thereof. In a specific embodiment, 
the invention relates to fusion proteins consisting of an Fc portion of an antibody which is 
fused as mentioned to the non-immunological target molecule which elicits biological or 

15 pharmacological efficacy. 

The molecules of the invention have amino acid sequences which are altered in one or more 
amino acid residue positions but have in principal the same biological activity as compared 
with the non-altered molecules. The changes are made in regions of the molecules which are 
identified as T-cell epitopes, which contribute to an immune reaction in a living host. Thus, 

20 the invention also relates to a novel method for preparing said fusion proteins by identifying 
said epitopes comprising calculation of T-cell epitope values for MHC Class II molecule 
binding sites in a peptide by computer-aided methods. 

BACKGROUND OF THE INVENTION 

25 Therapeutic fusion proteins are, as a rule, artificial molecules, which are produced to combine 
known favorable properties of the single components or to create new properties. For example, 
a fusion protein may contain an immunogenic moiety that causes a normally non- 
immunogenic fusion partner to become immunogenic. In other cases, each of the components 
are immunogenic and the fusion molecule has kept this usually undesired property. Finally, it 

30 is possible that fusing non or less immunogenic components the fusion product is 
immunogenic by creating the bonds, especially the junction region. 

Fusion proteins of specific interests in this context are immunoconjugates. Immunoconjugates 
are known since a couple of years and many of them have shown pharmacological efficacy in 
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v*. and in vivo. Int—jugates are chimel molecules consisbng, as a rufc. of a portion 
deriving from an immunoglobulin or a fragment .hereof ana a targer polypeptide or protein 
w „ich is United to the immunogiobuiin moiecuie. Originally. immunoconjuga.es were 
prepared consisbng of a eompiete antibody and a cytotoxic agent „ke a cytoitme . tch was 
1 via its N- terminus » the C-erminus of me constant domain of me immunog obu, n r 
La,ive,y via its C-terminus ,0 .be N-terminus of tbe variable region of .be an.,body(see. 
" I EP 0439 095, WO 92*8495, US 5.349,053. EP 0659 439. EP 0706 799). These 
Cleric molecules are b.-functionai by targeting a apec.fic antigen, for example, on a .urnor 
ce,, surface by means of tbe binding sues within the CDRs of the variab.e domain of the 
„ antibody portion or a fragment meteof. and by the simultaneous eytotox, effect of 
cytokine which is coupled to the immunoglobulm and thus can, meore.ically, only or 
pLominaridy attack the targeted cell. In .his conrext, also tn- and 
mmunoeonjuga.es were developed including construeu cons,s.ing of sFv, Fab, Fab o r 
F( ab')2 fragments of diffemn. andbodies, wherein in each case the targeting fune-on of 

,5 immunoglobulin portion was advantageously used. Antibo d,e S 

, K.,ii„ modification is the use of the Fc region of anl.bod.es. Antibodies 
Another immunoglobulin modification is me us. 

comprise two functionally independent parts, a variable domain known as Fab , Fab, 
■ F( ab'jr dependent on the kind of digestion of the molecule, which bind antigen and a 
constant domain, known as "Fc" which provides the link to effector functions such as 
20 complement or phagocytic cells. The Fc portion of an immunoglobulin 

half-life, whereas the Fab fragments are short-lived (Capon, e, a,.. Na,ur. 337. 525-53 

ThZudc Protein products have been construct using the Fc domain to provide longer 
half-life or to ,ncorpora.e functions such as Fc receptor binding, protein A btndmg, 
25 complement fixation and placenta, .ransfer which all reside in the Fc proretns of 
" immunoglobulin, For example, the Fc region of an fgOl antibody has been f -d»£N 
terminal end of CD30-L. a molecule which binds CD30 receptors expressed on H=4J» . 
Zse tumor cells, anaplastic lymphoma cells. T-cell leukemia cells and — 
cel, .ypes (US 5,480,98 1). 0,1ft an an.i-infiammatory and -rejection agent has been fused 
30 ml Fcy2a in order to increase the cytokine, short circulating half-life (Zheng « ai., 
rrul. ~nology. .54: 5590-5600 (1995,,. S.udies have also evaluated the useo 
Z Lis factor teceptor linked with the Fc protein of human ,gO, to tree. P a,,en K w.m 
septic shock (Fisher et of., N. Engl. J. Med., 334: ,697.(702 (,9,6,; Van Zee e,a,. 
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The Journal of Immunology. 156: 2221-2230 (1996)). Fc has also been fused with CD4 
receptor to produce a therapeutic protein for treatment of AIDS (see: Capon m a... Nature, 
337-525-531 (1989)). Principally. Fo can be fused to the target protein or pepttde via tts C- or 
N-terminus using the N- and C-terminus of the protein, respectively. A chimer of Fc and TNF 
and EPO was disclosed in Ef> 0464 533 (Hoechsr/ General Hospital), wherein the N-termruus 
of Fc was coupled ro the C-ternrinus of the protein (X-Fc). The identical conjunction was 
selected for ,eptin-Fc chimera dtsclosed in WO 97/003 19 (SKB) and WO 97/24440 
(Genentech). There are a lot of publications and patent applications describing the oppose 
hnkage of Fc-protein chtmers (Fc-X). such as Fc-(IL-2). Fc-EPO. Fc-PSMA. Fc-ffL- 12). Fc- 
o TNFa Fc(GM-CSF), Fc-TNFR. Fc-endostatin, Fc. angiosratin. Fc-gp 120. Fc-leptin. Fc-UrNa, 
Fc-(G-CSF) Examples are WO 96/08570, (Fuji / Merck KGaA), WO 98/28427 (Amgen), 
■ WO 99/02709 (Beth Israel Medical Care Center) and WO 99/58662 (Fuji / Merck KGaA). 
WO 00/24782 (Amgen) discloses a huge number of possible Fc-X conjugates, wheretn the 
hnkage between the two farmers may be Fc-X or X-Fc. An extensive development of Fc-X 
,5 molecules was realized by Lexigen / Merck KgaA as dtsclosed in US 5,541.087. WO 
99/43713 WO 99/29732, WO 99/52562. WO 99/53958. WO 00/11033, WO 01/07081. 
PCT/EP00/I0843.Thus, X-Fc and Fc-X molecules which have "lost" their antigen btndmg 
sites as well as molecules, wheretn the binding sites und thus their antigen-spedfic targeong 
functions are conserved, are of great interest as promising therapeudc proteins and there ex.sts 
20 a further need ,o develop analogue compositions for different clinical applicatton. 

Non-natural therapeutic proteins are often particularly immunogenic. For example. Enbrel ,s a 
fuston protetn consisting of an extracellular domain of a Tumor Necrosis Factor Receptor 
(TNF-R) fused to an Fc region of an antibody. About 16% of patients treated w,th Enbrel 
have been reported to develop antibodies to this fuston protein ( Mpfc*-* to* Reference 
2 5 (20011P 3372). Similarly, a fusion protein consisting of erythropoietin^) and granulocyte 
/ macrophage-colony stimulating factor (GMCSF) was found to be highly — g enic 
(Coscarella A. et al. [1998, Cytokine 10:964-9; Coscarella A. Mo, Btotechnol. [1998, 10.1 15 
«, When injected into a primate. Epo-GMCSF fusion proteins were found to induce a strong 
antibody response to the Epo moiety of the fuston protein, resulting in anenua. 
M Ceredase™ and Cerezyme™ are forms of the lysosomal enzyme 

rrea, Gaucher'* disease; as a result of genetic engineering, glucocerebrostdase ,s adached to an 
onusua, high— glycosylation. Patients w,th Gaucher, disease laok glucc— dase 
in rheir lysosomes, and as a result the patients' macrophages tend to accumulate hptds 
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become foam cells (77k Metabolic and Molecular Bases of Inherited Disease, 8 th Edition 
[2001] Scriver et al. eds. Chapter 146, "Gaucher Disease." p. 3635-3668). After 
administration of Ceredase™ or Cerezyme™, the therapeutic protein is bound by mannose 
receptors on macrophages, endocytosed, and trafficked through the endosomes to the 

5 lysosome, which is its proper location. Patients treated with Ceredase often develop 

antibodies to glucocerebrosidase (Pastores GM, et al., Blood [1993] 82:408-16; Physicians' 
Desk Reference [200 1] p. 1325- 1326). Such antibodies can interfere with treatment (Brady 
RO, et al., Pediatrics. [1997] 100(6):E11.). In a Phase I clinical trial using an antibody- 
cytokine fusion protein, several patients developed antibody responses to the therapeutic 

l0 fusion protein. In this case, the antibody moiety was a humanized form of the 14.18 anUbody, 
and the cytokine was interleukin-2 (IL-2). Many of the reactive patients' sera included 
significant levels of anti-idiotype antibodies. 

Therapeutic use of a number of peptides, polypeptides and proteins is curtailed because of 
15 their immunogenic^ in mammals, especially humans. For example, when murine anabod.es 
are administered to patients who are not immunosuppressed, a majority of such patients 
exhibit an immune reaction to the introduced foreign material by making human anti-murine 
antibodies (KAMA) (e.g. Schroff, R. W. et al (1985) Cancer Res. 45: 879-885; Shawler, D.L. 
et al (1985) J. Immunol. 135: 1530-1535). There are two serious consequences. First, the 
,0 patient's anti-murine antibody may bind and clear the therapeutic antibody or 

immunoconjugate before it has a chance to bind, for example to a tumor, and perform its 
therapeutic function. Second, the patient may develop an allergic sensitivity to the murine 
antibody and be at risk of anaphylactic shock upon any future exposure to murine 
immunoglobulin. 

Several techniques have been employed to address the HAMA problem and thus enable the 
use in humans of therapeutic monoclonal antibodies (see, for example, WO89/09622, 
EP0?39400 EP0438310, W09 1/09967). These recombinant DNA approaches have generally 
reduced the mouse genetic information in the final antibody construct whilst increasing the 
30 human genetic information in the final construct. Notwithstanding, the resultant "humanized' 

antibodies have, in several cases, still elicited an immune response in patients (Issacs J.D. 

(1990) Sem. Immunol. 2: 449, 456; Rebel.o, P.R. et a. (1999) Transplantation 68: 1417-1420). 
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A common aspect of these methodologies has been the introduction into the therapeutic 
antibody, usually of rodent origin, of amino acid residues, even significant tracts of amino acid 
residue sequences, identical to those present in human antibody proteins. For antibodies, this 
process is possible owing to the relatively high degree of structural (and functional) 

5 conservatism among antibody molecules of different species. For potentially therapeutic 

peptides, polypeptides and proteins, however, where no structural homologue may exist in the 
host species (e.g., human) for the therapeutic protein, such processes are not applicable. 
Furthermore, these methods have assumed that the general introduction of a human amino acid 
residue sequence will render the re-modeled antibody non-immunogenic. It is known, 

10 however, that certain short peptide sequences ("T-cell epitopes") can be released during the 
degradation of peptides, polypeptides or proteins within cells and subsequently be presented 
by molecules of the major histocompatability complex (MHC) in order to trigger the activation 
of T-cells. For peptides presented by MHC Class n, such activation of T-cells can then give 
rise to an antibody response by direct stimulation of B-cells to produce such antibodies. 

15 Accordingly, it would be desirable to eliminate potential T-cell epitopes from a peptide, 
polypeptide or a protein. Even proteins of human origin and with the same amino acid 
sequences as occur within humans can still induce an immune response in humans. Notable 
examples include therapeutic use of granulocyte-macrophage colony stimulating factor 
(Wadhwa, M. et al (1999) Clin. Cancer Res. 5: 1353-1361) and interferon alpha 2 (Russo, D. 

20 et al (1996) BrL J. Haem. 94: 300-305; Stein, R. et al (1988) New Engl. J. Med. 318: 1409- 
1413). 

During the last couple of years several techniques were published which suggest solutions for 
rendering antibodies and target proteins having different biological functions non- or at least 
less immunogenic. Examples are: WO 92/10755 and WO 96/40792 (Novo Nordisk), EP 0519 
25 596 (Merck & Co.), EP 0699 755(Centro de Immunologia Moelcular), WO 98/52976 and WO 
98/59244 and WO 00/34317 (Biovation Ltd.). 

The general methods disclosed in the prior art and regarding the elimination of T-cell epitopes 
from proteins (e.g. WO 98/52976, WO 00/34317) comprise the following steps: 
(a) Determining the amino acid sequence of the polypeptide or part thereof 
30 (b) Identifying one or more potential T-cell epitopes within the amino acid sequence of the 
protein by any method including determination of the binding of the peptides to MHC 
molecules using in vitro or in silico techniques or biological assays. 
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(c) Designing new sequence variants with one or more amino acids within the identified 
potential T-cell epitopes modified in such a way to substantially reduce or eliminate the 
activity of the T-cell epitope as determined by the binding of the peptides to MHC 
molecules using in vitro or in silico techniques or biological assays. Such sequence 

5 variants are created in such a way to avoid creation of new potential T-cell epitopes by the 
sequence variations unless such new potential T-cell epitopes are, in turn, modified in such 
a way to substantially reduce or eliminate the activity of the T-cell epitope. 

(d) Constructing such sequence variants by recombinant DNA techniques and testing said 
variants in order to identify one or more variants with desirable properties. 

10 

Other techniques exploiting soluble complexes of recombinant MHC molecules in 
combination with synthetic peptides and able to bind to T-cell clones from peripheral blood 
samples from human or experimental animal subjects have been used in the art [Kern, F. et al 
(1998) Nature Medicine 4:975-978; Kwok, W.W. et al (2001) TRENDS in Immunology 22: 
15 583-588] and may also be exploited in an epitope identification strategy. 

The potential T-cell epitopes are generally defined as any amino acid residue sequence with 
the ability to bind to MHC Class II molecules. Such potential T-cell epitopes can be measured 
to establish MHC binding. In the general understanding the term "T-cell epitope" is an epitope 
20 which when bound to MHC molecules can be recognized by the T-cell receptor, and which 
can, at least in principle, cause the activation of these T-cells. It is, however, usually 
understood that certain peptides which are found to bind to MHC Class II molecules may be 
retained in a protein sequence because such peptides are tolerated by the immune within the 
organism into which the final protein is administered. 

25 

The invention is conceived to overcome the practical reality that soluble proteins introduced 
into an autologous host with therapeutic intent, can trigger an immune response resulting in 
development of host antibodies that bind to the soluble protein. One example amongst others 
is interferon alpha 2 to which a proportion of human patients make antibodies despite the fact 
30 that this protein is produced endogenously [Russo, D. et al (1996) Brit. J. Haem. 94: 300-305; 
Stein, R. et al (1988) New Engl J. Med. 318: 1409-1413] 
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MHC Class II molecules are a group of htghly polymorphic proteins which play a central role 
in helper T-cell selection and activation. The human leukocyte antigen group DR (HLA-DR) 
are the predominant isotype of this group of proteins and the major focus of the present 
invention. However, isotypes HLA-DQ and HLA-DP perform sinular functions, hence the 
present tnvention is equally applicable to these. MHC HLA-DR molecules are homo-dtmers 
where each "half • is a hetero-dimer consisting of a and B chains. Each hetero-dimer possesses 
a ligand binding domain whtch binds to peptides varying between 9 and 20 amino acids tn 
le „g,h, although the binding groove can accommodate a maximum of 9 - 1 1 amino actds. The 
Ugand binding domain is comprised of amino acids 1 to 85 of the a chain, and amino actds 
0 to 94 of .he 6 chain. DQ molecules have recently been shown to have an homologous structure 
and the DP family proteins are also expected to be very similar. In humans approximately 70 
different allotypes of the DR isotype are known, for DQ there are 30 different allotypes and 
for DP 47 different allotypes are known. Each individual bears two to four DR alleles, two 
DO and two DP alleles. The structure of a number of DR molecules has been solved and such 
, 5 structures point to an open-ended peptide binding groove with a number of hyctophobtc 
pockets which engage hydrophobic residues (pocket residues) of the peptide [Brown et al 
Name (1993) 364: 33; Stem et al (1994) Nature 368: 215]. Polymorphs identifying the 
different allotypes of class n molecule contnbutes to a wide diversity of different btndtng 
surfaces for peptides within the peptide binding grove and at the population level ensures 
2 o maximal flexibility with regard to the ability to recognize forego proteins and mount an 
immune response to pathogenic organisms. 

There is a considerable amount of polymorphism within the ligand binding domatn wtth 
distinct -families- within different geographical populations and ethnic groups. Thts 
polymorphism affects the binding characteristics of the peptide binding domain, thus dtfferent 
25 -families" of DR molecules will have specificities for peptides with different sequence 
properties, alrhough there may be some overlap. Thts specificity determines recognition 
cell epitopes (Class II T-ce.l response) which are ultimately responsible for driving the 
antibody response to B-cell epitopes present on the same protem from which the Th-cell 
epitope is denved. Thus, the immune response to a protein in an individual is heavtly 
30 influenced by T-eel. epitope recognition which is a function of the peptide binding spec.fictty 
of ma, individual's HLA-DR allotype. Tnerefore, in order to identify T-ce„ epttopes w.*»v 
protein or peptide in the context of a global population, it is desirable to constder the bmdtng 
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properties of as diverse a set of HLA-DR allotypes as possible, thus covering as high a 
percentage of the world population as possible. 

A principal factor in the induction of an immune response is the presence within the protein of 
peptides that can stimulate the activity of T-cell via presentation on MHC class II molecules. 
5 In order to eliminate or reduce immunogenicity, it is thus desirable to identify and remove T- 
cell epitopes from the protein. 

According to the above-cited methods and related processes several biological molecules, 
basically usual target proteins and antibodies have been prepared which reveal reduced 
immunogenicity and allergenicity. Examples are: WO 99/55369 (SKB), WO 99/40198 and 
10 WO 96/2 1016 (Leuven Research & Development VZW), WO 00/08 196 (Duke University), 
WO 96/21036 (Chiron Viragen), WO 97/31025 (Chiron Corp.), WO 98/30706 (Alliance 
Pharmaceutical Corp.). 

In all these applications cited above single proteins or antibodies eliciting a lower immune 
15 response were disclosed; there is no hint that fusion proteins, above all immunoglobulin fusion 
proteins were completely or partially de-immunized, especially by reducing the number of T- 
cell epitopes within the sequence of said molecules by means of partially computational 
methods. In WO 97/24137 (Tannox Biosystems Inc.) a IFNa-Fc chimer is disclosed which 
contains a non-immunogenic linker molecule between the N-terminus of the Fc portion and 

20 the C-terminus of DFNcc. 

Therefore, it is still a need to provide for biological molecules, such as immunoconjugates, 
which are not or less immunogenic. Above all, it is of specific interest to provide for Fc- 
conjugates, preferably Fc-X chimers, wherein X is a selected protein or polypeptide of 
therapeutic interest. 

25 

SUMMARY OF THE INVENTION 

The present invention relates to four general aspects: 

(a) a novel application of the details of the immune response mechanism to situations 
involving fusion proteins and other artificial proteins, to help determine when an engineered or 

30 novel protein is likely to be immunogenic and therefore when application of a deimmunization 
methodology is appropriate, 

(b) novel biologically active artificial proteins to be administered especially to humans and in 
particular for therapeutic use, 
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(c) a method of designing improved, less immunogenic artificial proteins that normally have 
enhanced immunogenic^, the method comprising identification of one or more candidate T- 
cell epitopes in the artificial protein and introducing a mutation that removes one or more T- 
cell epitopes, and 

i (d) a convenient and effective computational method for the identification and calculate of 
T-cell epitopes for a globally diverse number of MHC Class O molecules and, based on tms 
knowledge, for designing and constructing new sequence variants of biological molecules with 
improved properties. Once T-cell epitopes have been identified in an artificial protem, they 
are removed by mutation as described in (c). 

Artificial proteins that have a component capable of binding to a surface receptor on a cell of 

the immune system are, in general, particularly immunogenic. Artificial proteins that are 

im munogenic as a consequence of having a moiety that binds to an immune cell surface 
receptor are particularly good candidates for the methods of the invention for reducmg 
15 immunogenicity. 

Without wishing to be bound by theory, Figures 1-6 present diagrams of artificial protems 
containing moieties that bind to immune cell surface receptors such as an Fc receptor, a 
cytokine receptor, or an oligosaccharide receptor. 

One method of the invention consists of the steps of identifying artificial proteins that contain 
,0 moieties that bind to an immune cell surface receptor, which may be done by sequence 
inspection, identifying candidate T-cell epitopes in the artificial protein, designing mutant 
derivatives of the artificial protein in which the number of T-cell epitopes is reduced, 
producing one or more mutant derivatives, testing the mutant derivatives for activity and 
optionally other desired properties, and choosing a mutant derivat.ve that has an optimal 
25 balance of reduced T-cell epitopes, retained activity, and optionally other retained domed 
properties. Other desired properties may include, but are not limited to, pharmacokinetrc 
properties and protein expression and assembly characteristics. 

Artificial proteins that tend to form aggregates are a second category of proteins that can be 
improved by the methods of the invention. 
30 One class of artificial proteins that can particularly be improved by the invention are Ig fusion 
proteins, such as fusion proteins comprising an entire antibody, as well as Fc-X and X-Fc 
fusion proteins. In particular, immunoglobulin fusion proteins comprising a ftmcbonal Fc 
receptor binding site can be particularly improved by methods of the invention. 



Pa ge 11 of 92 

wooaoeesutfjiM^sOMrm 



PCT/EP02/01690 

WO 02/066514 

- 10 - 

The invention provides improved forms of such antibody fusion proteins, which include fusion 
proteins comprising V regions that recognize tumor-specific antigens, other tissue-spec, fic 
antigens, or other disease-specific antigens. In one preferred embodiment, each of these 
antibodies is fused to a cytokine, such as IL-2. 

For example, the invention provides fusion proteins comprising the tumor-directed ann- 
EpCAM antibody KS 1/4 and anti-GD2 antibody 14.18, in which the V regions of the 
antibody contain mutations that remove T-cell epitopes. 

In a distinct embodiment, the moiety that is fused to the antibody moiety is mutated such that 
T-cell epitopes are removed. For example, the invention discloses an antibody-IL-2 fus.on 
0 protein in which the IL-2 moiety has been altered to remove T-cell epitopes. 

A second general class of Ig fusion proteins that can be significantly improved are the Fc-X 
and X-Fc fusion proteins. Without wishing to be bound by theory, it is thought that these 
proteins are particu.arly immunogenic because the Fc receptor binding site, which is normally 
15 somewhat sterically blocked by the light chain in an intact antibody, is exposed. In any case, 
it has been empirically established that an Fc fusion protein can be more immunogenic than 
the fusion partner by itself (WO01/07081). 

Another class of immunogenic fusion proteins are proteins that are fused to a cytokine. 
20 Without wishing to be bound by theory, it may be that these proteins are particularly 

immunogenic because when the fusion partner protein binds to an immune cell, for example a 
cell bearing an antibody that recognizes the fusion partner protein, the cytokine stimulates the 
cell in some way (see Figure 4). 



25 



30 



A class of artificial proteins that are particularly immunogenic are normal proteins that contam 
an inappropriate oligosaccharide. For example, a protein containing an oligosaccharide that » 
bound by a specific receptor on an immune cell is often found to be immunogemc. For 
example, a protein, preferably a protein such as beta-glucocerebrosidase that can be used to 
treat a lysosomal storage disorder, contains a high mannose oligosaccharide. Such an 
immunogenic protein shows significantly reduced immunogenic^ when modified accordmg 



to the invention. 
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The invention provides .ess immunogenic forms of .he following prolein moieties mat are 
incorporated inro orherwise immunogenic fusion proteins such as: ery.hropoie.in, leprm, 
keratinocye grow* factor. G-CSFOM-CSF. 1L-1R anragonist, sTNFR, TNF inhibitor, 
sTMFR-Fc (Enbrel®), BNTF. CNTF. members of the interferon family. hOH. B- 
*,ucocerebrosidase. All these biologically active prorein moieties .tared above derive from 
well known non-modified (parent) prolein moieties according to rhe tnventton. 

The modified proteins according ro rhe invention may be produced by .he mahod indica.e4 in 
Section -Detailed Description of the Invention" . The method includes a novel method for 
identification T-cel. epitopes by computational means. This method step is profaned accordmg 
to the invention and described in more detail in EXAMPLE 1 . 

The invention discloaes and claims as preferred embod,ments of rhe invention altered or 
modified fusion proteins derived from parent fusion proteins, said parent fusion prote.n 
essentially consisting of an immunoglobulin molecule or a fragment thereof and a non- 
immunoglobulin large, polypeptide (X). which is linked preferably by its N-remnma, to the C 
renminal of the immunoglobulin molecule or a fragment thereof, wherein the altered toon 
protein has an amino ,cid sequence different from that of said parent fusion protein and 
exhibits teduced immunogenic^ relative to the parent fusion protein when exposed to me 
0 immune system of a given species, that is preferably human. 

The strategies that are used ,n practice according to the invention to reduce the 
immunogenic.., of an immunogenic fusion protem are illustrated in detail for the anfbody- 
cytokine fusions. These general strategies include: 

. Examining the amino acid sequences in the fusion protein and prioritising rhem wtth 
respect to likely immunogentcity, based on the expected presence and abundance of the 
sequences during negative selection of T-cells in the thymus. For example, completely 
„o„-se,f epitopes are identified, and are .he highest priority for removal of T-cell epttopes 
by mutation. The lowest priori.y for temova, of T-cell epitopes are sequences ma, are 
prosen, in abundan. serum proteins, such as antibody constant regions or sequences rha 
are found in unhanged antibody V region, An interroediare priority for removal of 
T-ce„ epitopes by mu.a,ion are self sequences that are found in low abundance prorems. 
such as cytokines. Without wishing ro be bound by theory. 1. is expecred tha, low 
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abundance proteins may not be present in the thymus in high enough amounts to promote 
negative T-cell selection, and may thus be recognized as non-self T-cell epitopes. 
When a region is chosen for remova. of T-cell epitopes by mutation, it is compared w,th 
naturally occurring human sequences found in abundant proteins. Mutations are 
introduced to make any non-self sequences more similar to self sequences. For example, 
to reduce the immunogenicity of a mouse V region, the sequence is compared to un- ^ 
rearranged human V regions and the most closely related sequence is found. "Veneenng 
changes are introduced, in which some amino acids are converted from mouse to human. 
This has the effect of converting some non-self T-cell epitopes into self T-cell epitopes, a 
method for reducing immunogenicity disclosed by US 57 12 120, and also has the effect of 
removing some B cell epitopes. However, it is still necessary to remove T-cell epitopes 
that derive from hypervariable region sequences. 
. To remove most or all of the remaining T-cell epitopes, mutations are introduced that, by 
the computer-based criteria defined above, prevent binding of a peptide into a groove of 
an MHC Class II molecule. In the case of antibody V regions, it is preferable to introduce 
mutations that lie outside the CDRs themselves, to avoid interfering with antigen bmdmg. 
. in the case of fusion proteins of any type, it is generally the case that the fusion junctton 
will contain non-self T-cell epitopes. These T-cell epitopes may be also removed by 



mutation. 



A, a specific embodimen, the invention includes chimeric immunoglobulins or fragments 
there of wherein the reduced immunogenic!,,, the reduced number of T-ceU epitopes or the 
redU ced number of peptides binding ,o MHC class n molecules is locared to rhe target 
polypeptide portion X as we,, as <o rhe immunoglobulin poriion or fragments .hereof of the 
25 altered fusion protein. 

Tne invention includes also chimeric immunoglobutins as defined according ro rhe invention 
wherem .he immunoglobuHn molecule and tire non-imnrunoglobulin rarger polypeptide (X) 
„ fused via a linker molecule (L). As a specific embodimen, of the invention rhts tinker 
M molecule irsclf has no or lower immunogeniciry. Thus, the invention may ' 

immunoconjugates. wherein the tinker molecule alone is de,_d . Unker — 
wh ,ch have reduced or no immunogenic.,, a. known in .he an or can be prepared by known 
methods or by the method accotding to tire invention. The invention also includes such 
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-lobulin fusion proteins, wherein the immunoglobulin portion as well as the target 
protein (X) portion of the fusion molecule and optionally the linker molecule and the junction 
region (see below) are .mmunogenicly modified. Alternatively, only one or more but not a.l 
portions of the molecule are modified according to the invention. 

The invention relates, furthermore, to above-said immunoconjugates which may derive, in 
principal, from all immunoglobulin classes; however IgG is preferred. It is an object of the 
invention to provide such chimenc immunoglobulins which derive from IgGl, IgG2, IgG3 and 
IgG4. IgGl and IgG2 immunoglobulins are preferred; IgG2 immunoglobulins are most 

10 preferred. . , 

Since it has been shown that even recombinant proteins of human origin and humanized 
antibodies may elicit an undesirable immune response in humans it is object of this invention 
to provide fusion proteins wherein the immunoglobulin portion as well as the target 
polypeptide portion (X) may be selected from non-human as well as from human ongu, 
15 Since humanized or human-derived molecules have, as a rule, a less number of T-cell 

epitopes, such molecules are preferred for de-immunization, because a less number of ammo 
acid residues has to be modified. 

Im.unoconjuga.es (.mmunoglobulin (Ig) fusion proteins) according to .he invention include 
,„ also fragments of antibodies like sFv, Fab, Fab', F(ab')2 and Fc. It is a specific and preferred 
" object of tbe invention to provide said above- and be,ow-defin=d fusion proteins, wherein the 

immunoglobulin portion is a Fc domarn of an anfibcdy, preferably an IgG 1 or Ig02 anobody. 

Fc-X molecules according to tbe invention which have reduced affinity to Fc receptors are a 

preferred object of the invention. Fc molecules having a reduced affinity to Fc receptors are 
25 well known in the «. and can be prepared by modifying the amino acid sequence of the Fc 

domain (e.g. WO 99/43713). 



In detail, the invention refers to: 

. an immunogenic., modified fusion protein derived from a parent fusion prorern, 
3„ essential consisdng of a first protein / polypepride and a second protein / po ypepude, 
therein the first protein is an immunoglobulin molecule or a fragment thereof and me 
second protein / polyperide is non-immunoglobulin rarge, polypeptide (X) each linked to 
the other directly or by a linker molecule; said modified fusion protein having an ammo 
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acid sequence different from that of said parent fusion protein and exhibiting reduced 
immunogenicity by a reduced number of T-cell epitopes within its amino acid sequence 
relative to the parent fusion protein when exposed to the immune system of a given 
species; 

5 . a corresponding fusion protein, wherein said T-cell epitopes are peptide sequences able to 
bind to MCH class H molecule binding groups; 
. a corresponding fusion protein, wherein the target polypeptide (X) is linked by its N- 

terminal to the C-terminal of the immunoglobulin moiety;. 
. a correspondingly modified fusion protein, wherein the given species is a human; 
10 . a corresponding fusion protein, wherein the fusion components are fused via a linker 
molecule L; 

. a modified fusion protein according to claim 4, wherein said linker molecule L is non- 

immunogenic or less immunogenic; 
. a corresponding fusion protein, wherein the junction region represented by the C-terminal 
15 region of the immunoglobulin portion and the N-terminal region of the non- 

immunoglobulin target polypeptide (X) has no or a reduced number of T-cell epitopes; 
• a corresponding fusion protein, wherein the immunoglobulin portion or a fragment thereof 

or the target polypeptide (X) portion is less immunogenic; 
. a corresponding fusion protein, wherein said immunoglobulin molecule or fragment 

20 thereof is IgG 1 or IgG2; 

. a corresponding fusion protein, wherein said immunoglobulin fragment is a Fc portion, 

wherein, preferably, said Fc portion has a reduced affinity to Fc receptors; 
. an immunogenicly modified fusion protein according having the formula 

Fc - Ln-X 

25 wherein 

Fc is the Fc portion of an immunoglobulin molecule (antibody), 

X is a non-immunoglobulin target polypeptide 

L is a linker peptide, 

n = 0 or 1, and 

30 wherein X and / or L comprises amino acid residue modifications which elicit a reduced 

immunogenicity compared to the parent molecule. 

Preferred embodiments of these immunogenicly modified Fc fusion molecules are: 
Fc - X m , wherein X is modified only, 
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Fc - L m - X m , wherein X and L are modified to have a reduced immunogenicity, 

Fc - X m , wherein X and the junction region between Fc and X are modified, 

Fc - L m - X m , wherein X and L and the junction regions between Fc , X and L are 

modified; 

a corresponding Fc - (L) - X fusion protein wherein at least X is immunogenicly 
modified; 

. an immunogenics modified fusion protein having the formula 



A- U- X 



wherein 
10 A 



is a whole antibody or its sFv, Fab, Fab", F(ab') 2 fragments 
X is a non-immunoglobulin target polypeptide 

L is a linker peptide, 

n = 0 or 1 , and 

wherein A and / or X and / or L comprise amino acid residue modifications which elicit a 

15 reduced immunogenicity compared to the parent molecule; 

Preferred embodiments of these immunogenicly modified fusion molecules are: 

A - X m , wherein X is modified only, optionally the A-X junction region, 

A m - X m , wherein A and X are modified, optionally their junction region, 

A - L m - X m , wherein X and L are modified only to have a reduced immunogenicity, 

20 A m - X , wherein A has reduced immunogenicity only, optionally the A-X junction 
region, 

A m - L m - X m , wherein A, L and X are immunogenicly modified, optionally the A-L-X 
junction regions; 

. a corresponding A - (L) - X fusion moiecule, wherein a, leas. X or A is immunogen.ciy 

25 modified; 

• a correponding fusion protein, wherein A is selected from the group: 

anti- EGF receptor (HERD antibodies 
anti- HER2 antibodies 

anti- CDx antibodies, wherein x is an integer from 1 - 25 
30 anti- cytokine receptor antibodies 

anti- 17-1 A antibodies, 
anti- KSA antibodies 
anti-GP Ilb/IIIa antibodies 
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anti-integrin receptor antibodies 

anti VEGF receptor antibodies; 
• a correpondingly fusion protein, wherein the antibody is selected from the group: 

monoclonal antibody 225 and derivatives, 
5 monoclonal antibody 425 and derivatives 

monoclonal antibody KS 1/4 and derivatives 

monoclonal antibody 14.18 and derivatives 

monoclonal antibody 4D5 / HER2 (Herceptin®) and derivatives 

monoclonal antibody 17-1 A and derivatives 
10 monoclonal anti-CD3 antibodies 

monoclonal antibody 7E3 and derivatives 

monoclonal antibodies LM609, P1F6 and 14D9.F8 and derivatives 
monoclonal antibody DC-101 and derivatives 
monoclonal anti-Il-2R antibody (Zenapax®) and derivatives 
15 • a corresponding fusion protein, wherein the target polypeptide X is selected from the 
group: 

cytokines, integrin inhibitors, soluble cytokine receptors, glycoproteins, hormones, 
glycoprotein hormones, leptin, growth hormones, growth factors, antihemophilic factors, 
antigens, cytokine receptor antagonists; 
20 • a corresponding fusion protein, wherein the target polypeptide X is selected from the 

group: 

EL-2, G-CSF, GM-CSF, EPO, TPO, members of the interferon family, TNFa, soluble 
TNF receptor, EL-12, EL-8, factor VIII, FGF, TGF, EGF, VEGF, PMSA, IGF, insulin, 
RGD-peptides, endostatin, angiostatin, BDNF, CNTF, protein c, factor VIE and IX, and 
25 and biologically active fragments thereof; 

• a more specidied corresponding fusion protein selected from the group: 
MAb KS 1/4 - IL2, MAb 14.18 - EL2 

MAb 425 - EL2, MAb c425 - IL2, MAb h425 - EL2, MAb 425 - TNFa 
MAb 225 - IL2, MAb c225 - EL2 
30 MAb 4D5 - IL2, MAb DC101 - D2, MAb LM609 - IL2, 

Fc - IL2 , Fc - TNFa, Fc - G-CSF, Fc - EPO, Fc - Leptin, Fc - KGF, 
Fc - BFNF, FC - B-Cerebrosidase, Fc - TPO, Fc - GM-CSF; 
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an immunogenic^ modified artificial protein selected from the group: 

(i) Y - (L) - X, wherein Y is a cytokine and X, (L) is a molecule as defined above, 

(ii) P - (L) - X, wherein P is a protein with unusual glycosylate moieties and X, (L) is 
a molecule as defined above, 

(iii) A - (L) - X, wherein A, X (L) is a molecule as defined above, 

derived from a parent artificial protein having an amino acid sequence which is different 
from that of said parent artificial protein and exhibits reduced immunogenicity by a 
reduced number of T-cell epitopes relative to the parent fusion protein when exposed to 
the immune system of a given species, wherein said T-cell epitopes are peptide sequences 
able to bind to MCH class II molecule binding groups obtainable or obtained by a method 
as specified in this invention; 

. a DNA sequence encoding any fusion protein as specified above and below; 

. a DNA sequence encoding a corresponding fusion protein, comprising 

(i) a signal sequence 

(ii) a DNA sequence encoding all domains or a Fc, sFV, Fab, Fab 1 or F(ab')2 domain of an 
IgGl, IgG2 or IgG3 antibody, and 

(ii) a DNA sequence encoding the polypeptide (X), and optionally 

(iii) a DNA sequence encoding the linker molecule; 

. an expression vector comprising a corresponding DNA sequence; 

. a pharmaceutical composition comprising a fusion protein as specified above and below, 
optionally together with a suitable carrier, excipient or diluent or another therapeutically 
effective drug, such as chemotherapeutics or cytotoxic drugs; 

. a method for preparing an immunogenicly modified fusion protein as specified 
comprising the steps: 

(i) determining the amino acid sequence of the parent fusion protein or part thereof, 
• (ii) identifying one or more potential T-cell epitopes within the amino acid sequence of 
the fusion protein by any method including determination of the binding of the peptides to 
MHC molecules using in vitro or in silica techniques or biological assays, (iii) des.gn.ng 
new sequence variants by alteration of at least one amino acid residue within the 
originally identified T-cell epitope sequences, said variants are modified in such a way to 
substantially reduce or eliminate the activity or number of the T-cell epitope sequences 
and / or the number of MHC allotypes able to bind peptides derived from said biolo gl cal 
molecule as determined by the binding of the peptides to MHC molecules using in vitro or 
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in silico techniques or biological assays or by binding of peptide-MHC complexes to T- 
cells, (iv) constructing such sequence variants by recombinant DNA techniques and 
testing said variants in order to identify one or more variants with desirable properties, 
and (v) optionally repeating steps (ii) - (iv), characterized in that the identification of T- 
5 cell epitope sequences according to step (ii) is achieved by 

(a) selecting a region of the peptide having a known amino acid residue sequence; 

(b) sequentially sampling overlapping amino acid residue segments of predetermined 
uniform size and constituted by at least three amino acid residues from the selected 
region; (c) calculating MHC Class II molecule binding score for each said sampled 

10 segment by summing assigned values for each hydrophobic amino acid residue side chain 

present in said sampled amino acid residue segment; and (d) identifying at least one of 
said segments suitable for modification, based on the calculated MHC Class II molecule 
binding score for that segment, to change overall MHC Class II binding score for the 
peptide without substantially the reducing therapeutic utility of the peptide; 

15 • a corresponding method, wherein step (c) is carried out by using a B6hm scoring function 
modified to include 12-6 van der Waal's ligand-protein energy repulsive term and ligand 
conformational energy term by (1) providing a first data base of MHC Class II molecule 
models; (2) providing a second data base of allowed peptide backbones for said MHC 
Class II molecule models; (3) selecting a model from said first data base; (4) selecting an 

20 allowed peptide backbone from said second data base; (5) identifying amino acid residue 

side chains present in each sampled segment; (6) determining the binding affinity value 
for all side chains present in each sampled segment; and optionally (7) repeating steps (1) 
through (5) for each said model and each said backbone; 

• a corresponding method, wherein the sampled amino acid residue segment is constituted 
25 by 13 amino acid residues and / or consecutive sampled amino acid residue segments 

overlap by one to five amino acid residues; 

• a corresponding method, wherein 1-9 amino acid residues, preferably one amino acid 
residue, in any of the originally present T-cell epitope sequences (is) are altered; 

• a corresponding method, wherein the alteration of the amino acid residues is substitution, 
30 deletion or addition of originally present amino acid(s) residue(s) by other amino acid 

residue(s) at specific position(s); 

• a corresponding method, wherein additionally further alteration by substitution, deletion 
or addition is conducted to restore biological activity of said biological molecule. 
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The polypeptides according to the invention include also antigens, like PMSA and others. 
Antigens which elicit a not desired and too strong immune response can be modified 
according to the method of the invention and result in antigens which have a reduced 

5 immunogenicity which is however strong enough for using the antigen e.g. as vaccme. 

The invention includes also variants and other modification of a specific polypeptide, protein, 
fusion protein, immunoglobulin or immunoconjugate which have in principal the same 
biological activity and a similar (reduced) immunogenicity. All proteins mentioned above are 
well known and described in the art or are already commercially available. Most of them are 

10 known to have a proved therapeutic benefit. The leader or signal sequences and linker 
sequences may be optional. 

Preparing the fusion protein by linking the immunoglobulin component by its C-terminus or 
its fragment to the N-terminus of the non-immunoglobulin target polypeptide (X), optionally 
via the linker molecule according to step (ii) as described above, is carried out by: 
IS (i) preparing a gene construct comprising a DNA sequence encoding the polypeptide X, a 
DNA sequence encoding the immunoglobulin molecule or fragments thereof [sFv, Fab, Fab , 
F(ab') 2 , Fc], and optionally the DNA sequence of a the linker molecule, and 
(ii) expressing the gene construct by an expression system. 



20 



The immunoconjugates according to the present invention reveal enhanced properties. Thus 
decreased protein degradation, increased stability and enhanced serum circulation half-life can 
be measured as well as a distinctly reduced immunogenicity and / or allergenic.ty. 
Surprisingly, the reduced immunogenicity leads in many cases to a further increase of half- 
life especially in cases where Fc-X molecules according to the invention are used. The 
,5 reduced immunogenicity makes the fusion proteins according to the invention more tolerable 
for a given species compared to the non-modified fusion proteins and, therefore, can be 
administered in higher dosages if necessary. 

BRIEF DESCRIPTION OF THE DRAWINGS 
30 figurel illustrates one of the mechanisms by which fusion proteins displays enhanced 
immunogenicity. Figure la shows a protein ("X") fused to an Fc moiety binding to a cell 
bearino an Fc receptor. Figure lb shows the fusion protein being processed such that the X 



Page 21 o f 92 



10 



PCT/EP02/01690 

WO 02/066514 

- 20 - 

moiety is preferentially degraded. Figure 1c shows a peptide remnant of "X" being presented 
by an MHC molecule to a T-cell. 

Figure2 shows a mechanism of enhanced immunogenicity for a fusion protein 
comprising an Fc moiety and a second moiety. Figure 2a shows the binding of the fusion 
protein to a B cell that expresses an antibody specific for "X" on its surface. The fusion 
protein is bound by both the specific antibody and by Fc receptors that are not already bound 
by the antibody. 

Figures illustrates a second mechanism by which fusion proteins displays enhanced 
immunogenicity. Figure la shows a protein ("X") fused to an cytokine moiety binding to a B 
cell with a surface-bound antibody. Figure 3b shows the fusion protein being processed. 
Figure 3c shows a peptide remnant of "X" being presented by an MHC molecule to a T-cell, at 
the same time that additional X-cytokine fusion protein is bound to the surface of the B cell. 
Fi gure 4 illustrates another mechanism by which an engineered protein displays 
enhanced immunogenicity. In this case, a cytokine-X fusion protein directly activates a B cell. 
The B cell synthesizes a specific antibody to X, which increases the local concentrate of the 
cytokine in the neighborhood of the B cell. 

FigU re 5 + illustrates another mechanism by which an engineered protein displays 
enhanced immunogenicity. Figure 5a shows the binding of a protein bearing a glycosylation 
moiety to a specific cell-surface receptor for that glycosylation moiety on an immune cell. 
, Figure 5b shows the uptake and degradation of the glycosylated protein. Figure 5c shows the 
presentation of a peptide remnant of the glycosylated protein to a T-cell via an MHC 

molecule. . 
Fi^ shows a mechanism by which an antibody-cytokine fusion protem delays 

enhanced immunogenicity. Figure 6a shows the binding of the antibody-cytokine fuse 
■5 protein to a B cell that expresses an antibody specific for the CDRs of the antibody-cytolone 
fusion protein. The fusion protein is bound by both the specific antibody and by Fc receptors 
that are not already bound by the antibody. Figure 6b shows the fusion protein being 
processed. Figure 6c shows a peptide remnant of the CDRs being presented by an MHC 
molecule to a T-cell, at the same time that additional antibody-cytokine fusion protein is 
30 bound to the surface of the B cell. 
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DETAILED DESCRIPTION OF THE INVENTION 

The n "T-cell epitope" means according to .he understanding of this invention an atntno 
acid sequence which is ab.e to bind with reasonable efficiency MHC class U molecules (or 
their equivalent in a non-human species), able to stimula.e T-ce,ls and / or also ,0 mod 
j (without necessarily measurably activating) T-cells in complex with MHC clas S II. 

The .enn -peptide" as used herein and in the appended claims, is a compound that includes 
two or more amino acid, The amino acids are linked together by a peptide bond (defined 
herein below). There are 20 different naturally occurring amino acids involved tn the 
biological production of peptide, and any number of them may be linked in any order to form 
l0 a peptide chain or ring. The naturally occurring amino acids employed in the biologtcal 
production of peptides all have the L-configuration. Synthetic peptides can be prepared 
employing conventional synthetic methods, utilizing L-amino acids, D-amino acids, or vartous 
combinations of amino acids of the two different configuration, Some peptides comatn only 
a few amino actd unit, Short peptides, e.g., havtng less than ten amino actd untts, are 
U sometimes referred to as -oligopeptides". Other peptides contain a large number of am.no 
acid residues, e.g. up ,o 100 or more, and are referred ,o as "polypeptides". By convention^ 
•■polypeptide" may be considered as any peptide chain containing three or more arn.no actd, 
whereas a "oligopeptide- is usuaUy considered as a particular type of "shorr" polypeptide. 
Thus as used herein, it is understood .ha. any reference ,o a "polypeptide" also includes an 
20 oligopeptide. Farmer, any reference «o a "peptide" includes po,ypep«ide, oligopeptides, and 
protel Each differen. arrangement of amino acids forms different polypeptides or pro.em, 
The number of polypeptides-**, hence the number o, different prote,ns-ha. can be formed ts 
oractically unlimited. 

The term "less or reduced immunogenic^)" used before and thereafter is a relative term and 
25 relates to .he immunogenic^ of ,he respective original source molecule when exposed* vtvo 
,o me same type of spectes compared with the molecule modified according .0 1 e 
The term "modtfied protetn" as used accoming to this invention describes a protetn wfcch has 
reduced number of T-ce„ epitopes and elicits therefore a reduced immunogenic,., relative 
the parent ptotein when exposed to the immune system of a given spectes. 
» The term "non-modified protetn" as used according to this invention desenbes the parent 
protein as compared .o ,he "modified protein" and has a larger number of T- eel, ep^pes and, 
lerefore, an enhanced immunogentcity relative to the modified protetn when exposed to .he 
immune system of a given species. 
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The term "biologically active protein" as used here and in the claims includes according to the 
invention polypetides, proteins, immunoglobulins such as antibodies, antibody fragments, 
fusion proteins, enzymes, antigens and so on, if not defined otherwise, which ehat a 
biological and / or therapeutic effect. 
, The term "cytokine" is used herein to describe proteins, analogs thereof, and fragments thereof 
which are produced and excreted by a cell, and which elicit a specific response in a cell wh.ch 
has a receptor for that cytokine. Preferably, cytokines include interleukins such as interleukm- 
2 (IL-2), hematopoietic factors such as granulocyte-macrophage colony stimulating factor 
(GM-CSF), tumor necrosis factor (TNF) such as TNFa, and lymphokines such as 
0 lymphotoxin. Preferably, the antibody-cytokine fusion protein of the present invention 

displays cytolone biological activity. In principal, the inventions encompasses all cytokmes as 
recently classified according to their receptor code (Inglot, 1997, Archivum Immunology et 
TherapiaeExperimentalis,45: 353). 

The phrase "single chain Fv" or "scFv" refers to an antibody in which the heavy cham and the 
15 light chain of a traditional two chain antibody have been joined to form one chain. Typically, a 
Unker peptide is inserted between the two chains to allow for proper folding and creation of an 
active binding site. 

The terra "Fc region" or "Fc domain" as used in this inven.ion is understood to mean the 
carboxyl terminal portion of an immunoglobulin heavy chain constant region, or an analog 
20 or portion thereof capable of binding an Fc teceptor. As is known, each 

iraraunoglobulin heavy chain constant region comprises four or five domain, The domams are 
named sequentially as follows: CHI -hi„ge-CH2.CH3(-CH4). CH4 is present in IgM, whtch 
has no hinge region. The immunoglobulin heavy chain oonstan, region useful in the pracuce of 
me invention preferably comprises an tmmunoglobulin hinge region, and preferably also 
25 includes a CH3 domain. The immunoglobulin heavy chain constant region moat preferably 
comprises an immunoglobulin hinge mgion, a CH2 domain and a CH3 domain. The preferred 
F= domain according to this invention consists thus of .he hinge-CH2.CH3 domatn. 
M used herein, the tern, immunoglobulin "hinge region" is understood to mean an enttte 
immunoglobulin hinge region or a, leas, a portion of the tmmunoglobulin hinge regton 
w sufficient .o form one or more disuW.de bonds with a second immunoglobulin hinge regton. 
As used herein, .he t =tra "signal sequence" is unders.ood .o mean a segment which ditects the 
secretion of the fus.on protein and thereafter is cleaved following translation in the host cell. 
The signal sequence of me invemion is a polym.cleo.ide which encodes an ammo acd 
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sequence which initiate, transport of a proretn across the membrane of the endoplasmic 
reticulum. Signal sequences which are usefuf in the invention include antibody iigh. cham 
signa, sequences, e.g.. antibody 14.11 (Gillies er. at. (1989) J- Immunol. Meth. 125 : 191) and 
any other signa, sequences which are known in the « (see, e.g.. Watson, 1984, Nucleic Acd 

5 Research 12:5145). 

The term "mutant or variant" used with respect to a particular protein encompasses any 
molecule such as a truncated or other derivative of the relevant protein which retams 
substantially the same activity in humans as the relevant protein. Such other derivatives can be 
prepared by the addition, deletion, substitution, or rearrangement of amino acids or by 

10 chemical modifications thereof. 

It is contemplated that suitable immunoglobulin heavy chain constant regions may be derived 
from antibodies belonging to each of the immunoglobulin classes referred to as IgA, IgD, IgE, 
IgG, and IgM, however, immunoglobulin heavy chain constant regions from the IgG class are 

15 preferred. 

Furthermore, it is contemplated that immunoglobulin heavy chain constant regsons may be 
derived from any of the IgG antibody subclasses referred to in the art as IgGl, IgG2, IgG3. 
and IgG4. Immunoglobulin heavy chain constant region domains have cross-homology 
among the immunoglobulin classes. For example, the CH2 domain of IgG is homologous to 
20 the CH2 domain of IgA and IgD, and to the CH3 domain of IgM and IgE. The cho.ce of 
appropriate immunoglobulin heavy chain constant regions is discussed in detad m US 
5 541 087 and 5,726,044. The choice of particular immunoglobulin heavy cham constant 
region sequences from certain immunoglobulin classes and subclasses to achieve a 
particular result is considered to be within the level of skill in the art. It may be useful, in some 
25 circumstances, to modify the immunoglobulin heavy chain constant region, for example, by 
station, deletion or other changes mediated by genetic engineering or other approaches, so 
that certain activities, such as complement fixation or stimulation of antibody-dependent cell- 
mediated cytotoxicity (ADCC) are reduced or eliminated. 

The Fc region is considered non- or weakly immunogenic if the immunoglobulin heavy cham 
30 constant region fails to generate a detectable antibody response. 

Furthermore, it is contemplated that substitution or deletion of amino acids within 
the immunoglobulin heavy chain constant regions may be useful in the practice of the 
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invention. One example may include introducing amino acid substitutions in the upper CH2 
region to create a Fc variant with reduced affinity for Fc receptors (Cole et al. (1997) J. 
Immunol. 159:36 13). An antibody-based fusion protein with an enhanced in vivo circulating 
half-life can be obtained by constructing a fusion protein having reduced binding affinity for 
5 a Fc receptor, and avoiding the use of sequences from antibody isotypes that bind to Fc 

receptors (WO 99/437 13). For example, of the four known IgG isotypes, IgGl (Cyl) and IgG3 
(C Y 3) are known to bind FcRvl with high affinity, whereas I g G4 has a 10-fold lower binding 
affinity and IgG2 (Cy2) does not bind to FcR Y l- Thus, an antibody-based fusion protein with 
reduced binding affinity for a Fc receptor could be obtained by constructing a fusion protem 
10 with a Cy2 constant region (Fc region) or a C Y 4 Fc region, and avoiding constructs with a Cyl 
Fc region or a Cy3 Fc region. An antibody-based fusion protein with an enhanced in vtvo 
circulating half-life can be obtained by modifying sequences necessary for binding to Fc 
receptors in isotypes that have binding affinity for an Fc receptor, in order to reduce or 

eliminate binding. . 
15 The important sequences for FcvR binding are Esu-Leu-G.y-Gly (residues 234 through 237 in 

CYD, located in the CH2 domain adjacent to the hinge (Canfield and Morrison, J. Exp. Med. 

173- 1483-1491 (1991)). Another important structural component necessary for effective FcR 

binding is the presence of an N-linked carbohydrate chain covalently bound to Asn 297 . 

Enzymatic removal of this structure or mutation of the Asn residue effectively abolish, or at 
20 least dramatically reduce, binding to all classes of FcvR. 

The resulting antibody-based fusion proteins have a longer in vivo circulating half-life than the 
unlinked second non-immunog.obulin protein. Dimerization of a ligand can increase the 
apparent binding affinity between the ligand and its receptor. For instance, if one X moiety of 
25 an Fc-X fusion protein can bind to a receptor on a cell with a certain affinity, the second X 
m oiety of the same Fc-Interferon-alpha fusion protein may bind to a second receptor on the 
same cell with a much higher avidity (apparent affinity). This may occur because of the 
physical proximity of the second X moiety to the receptor after the first X moiety already 
is bound. In the case of an antibody binding to an antigen, the apparent affinity may be 
30 increased by at least ten thousand-fold. Each protein subunit, i.e., "X," has its own 

independent function so that in a multivalent molecule, the functions of the protein subunits 
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may be additive or synergistic. Thus, fusion of the normally dimeric Fc molecule or another 
antibody fragment to a polypeptide X may increase the activity of X. 

Nucleic acid sequences encoding, and amino acid sequences defining a 
5 human immunoglobulin Fc region, especially a Fcyl, Fcy2 and Fcy3, useful in the practice of 
the invention are set forth in in the prior, such as disclosed in (WO 00/40615, WO 00/69913, 
WO 00/24782) or in the Genbank and/or EMBL databases, for example, AF045536.1 
(Macacafuscicularis), AF045537.1 (Macaca mulatto), ABO 167 10 (Felix cams), K00752 
(Oryctolagus cuniculus), U03780 (Sus scrofa), 248947 (Camelus dromedarius), X62916 (Bos 
10 taurus), L07789 (Mustela vison), X69797 (Ovis aries), U17 166 (Cricetulus migratorius), 
X07159 (Rattus rattus), AF57619.1 (Trichosurus vulpecula), or AF035795 (Monodelphis 
domestica). 

Thus, vectors reported earlier (Lo et al. (1998) Protein Engeneering 1 1:495-500) were 
modified by replacing the human IgGl Fc sequence with sequences from cDNA encoding the 
15 mouse IgG2a Fc (US 5,726,044). 

The invention encompasses mutations in the immunoglobulin component which eliminate 
undesirable properties of the native immunoglobulin, such as Fc receptor binding and/or 
introduce desirable properties such as stability. For example, Angal S., King D.J., Bodmer 
20 M.W., Turner A., Lawson A.D.G., Roberts G, Pedley B. and Adair R., Molecular 

Immunology, 130, ppl05-108, 1993, describe an IgG4 molecule where residue 241 (Kabat 
numbering) is altered from serine to proline. This change increases the serum half-life of the 
IgG4 molecule. Canfield S.M. and Morrison S.L., Journal of Experimental 
Medicine vol I73pp 1483- 149 1, describe the alteration of residue 248 (Kabat numbering) 
25 from leucine to glutamate in IgG3 and from glutamate to leucine in mouse IgG2b. Substitution 
of leucine for glutamate in the former decreases the affinity of the immunoglobulin molecule 
concerned for the FcyRl receptor, and substitution of glutamate for leucine in the latter 
increases the affinity. EP 0307 434 discloses various mutations including an L to E mutation at 
residue 248 (Kabat numbering) in IgG. The constant domain(s) or fragment thereof is 
30 preferably the whole or a substantial part of the constant region of the heavy chain of human 
IgG. The IgG component suitably comprises the CH2 and CH3 domains and the hinge region 
including cysteine residues contributing to inter-heavy chain disulphide bonding. For example 
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when the IgG component is derived from IgG4 it includes cysteine residues 8 and 1 1 of the 
IgG4 hinge region (Pinck J.R. and Milstein C, Nature, 121, 6pp941-942, 1967). 

The process of the invention may be performed by conventional recombinant techniques such 
5 as described in Maniatis et, al. ( Molecular Cloning - A Laboratory Manual; Cold Spring 
Harbor, 1982) and DNA Cloning Vols I, II and HI (D.M. Glover ed., IRL Press Ltd) or 
Sambrook et al. (1989, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, NY, USA). 

10 In particular, the process may comprise the steps of: 

(i) preparing a replicable expression vector capable, in a host cell, of expressing a DNA 
polymer comprising a nucleotide sequence that encodes said compound; 

(ii) transforming a host cell with said vector, 

(iii) culturing said transformed host cell under conditions permitting expression of said DNA 
15 polymer to produce said compound; and 

(iv) recovering said compound. 

The invention also provides a process for preparing the DNA polymer by the condensation of 
appropriate mono-, di- or oligomeric nucleotide units. The preparation may be carried out 

20 chemically, enzymatically, or by a combination of the two methods, in virro or in vivo as 

appropriate. Thus, the DNA polymer may be prepared by the enzymatic ligation of appropriate 
DNA fragments, by conventional methods such as those described by D. M. Roberts et al in 
Biochemistry 1985, 24, 5090-5098. The DNA fragments may be obtained by digestion of 
DNA containing the required sequences of nucleotides with appropriate restriction enzymes, 

25 by chemical synthesis, by enzymatic polymerisation on DNA or RNA templates, or by a 
combination of these methods. Digestion with restriction enzymes may be performed in an 
appropriate buffer at a temperature of 20°-70°C with 0.1- lOjxg DNA. Enzymatic 
- polymerisation of DNA may be carried out in vitro using a DNA polymerase such as DNA 
polymerase I (Klenow fragment) in an appropriate buffer containing the nucleoside 

30 uiphosphates dATP, dCTP, dGTP and dlTP as required at a temperature of 10°-37°C, 

generally in a volume of SOjxl or less. Enzymatic ligation of DNA fragments may be carried 
out using a DNA ligase such as T4 DNA ligase in an appropriate buffer at a temperature of 
40°C to ambient, generally in a volume of SOjliI or less. 
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The chemical synthesis of the DNA polymer or fragments may be carried out by conventional 
phosphotriester, phosphite or phosphoramidite chemistry, using solid phase techniques such as 
those described in "Chemical and Enzymatic Synthesis of Gene Fragments - A Laboratory 

5 Manual" (ed. H.G. Gassen and A. Lang), Verlag Chemie, Weinheim (1982),or in other 
scientific publications, for example M.J. Gait, H.W.D. Matthes, M. Singh, B.S. Sproat, and 
RG. Titmas, Nucleic Acids Research, 1982,10, 6243; B.S. Sproat and W. Bannwarth, 
Tetrahedron Letters, 1983,24,5771; M.D. Matteucci and M.H Caruthers, Tetrahedron Letters, 
1980,21,719; M.D. Matteucci and M.H. Caruthers, Journal of the American Chemical Society, 

10 1981,103, 3185; S.P. Adams et aL, Journal of the American Chemical Society,1983, 105, 661; 
N.D. Sinha, J. Biemat, J. McMannus, andH. Koester, Nucleic Acids Research, 1984, 12,4539; 
and H.W.D. Matthes et ah, EMBO Journal, 1984,3,801. Preferably an automated 
DNA synthesizer is employed. 

15 The DNA molecules may be obtained by the digestion with suitable restriction enzymes of 
vectors carrying the required coding sequences or by use of polymerase chain reaction 
technology. The precise structure of the DNA molecules and the way in which they 
are obtained depends upon the structure of the desired product. The design of a 
suitable strategy for the construction of the DNA molecule coding for the compound is a 

20 routine matter for the skilled worker in the art. 

The expression of the DNA polymer encoding the compound in a recombinant host cell may 
be carried out by means of a replicable expression vector capable, in the host cell, of 
expressing the DNA polymer. The expression vector is novel and also forms part of the 
25 invention. The replicable expression vector may be prepared in accordance with the invention, 
by cleaving a vector compatible with the host cell to provide a linear DNA segment having an 
intact replicon, and combining said linear segment with one or more DNA molecules which, 
together with said linear segment, encode the compound, under ligating conditions. The 
ligation of the linear segment and more than one DNA molecule may be carried out 
30 simultaneously or sequentially as desired. Thus, the DNA polymer may be preformed or 
formed during the construction of the vector, as desired. A useful expression vector is 
described at Lo et al. (1988) Protein Engineering 1 1 :495, in which the transcription of the Fc 
X gene utilizes the enhancer/promoter of the human cytomegalovirus and the SV40 
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polyadenylation signal. Suitable vectors include plasmids, bacteriophages, cosmids and 
recombinant viruses denved from, for example, baculoviruses. vaccinia or Semlild Fores, 
vims. Thus, vectors reported earlier (Lo « al. (1998) Protein Engeneering 11:495-500 were 

u u nTo riPr «-auence with sequences from cDNA encoding the 
modified by replacing the human IgGl Fc sequence wiui bcquc 

5 mouse IgG2a Fc (US 5,726,044). 

The choice of vector will be determined in part by the host cell, which may be prokaryotic, 
such as E. coli, or eukaryotic, such as mouse C127, mouse myeloma, Chinese hamster ovary, 
COS or Hela cells, fungi e.g. filamentous fungi or unicellular yeast or an insect cell such as 
l0 Drosophil, Currently preferred host cells for use in the invention include immorta hybndoma 
cells, NS/0 myeloma cells, 293 cells, Chinese hamster ovary cells, HELA cells, and COS cells. 
The host cell may also be a transgenic animal. 

Polymerisation and ligation may be performed as descnbed above for .be preparation of the 
u DNA polymer. Digestion with mstriction enzymes may be performed in an appropnate buffer 
at aremperatureof20°-70-C withO.l -lOug DNA. The recombinant host cell is prepared, in 
accordance with the invention, by transformmg a host cell with a repiicabie expression vector 
of the invention under transforming conditions. Suitable transforming conditions are 
conventional and are described in, for example, Maniatis e, al., cited above, or "DNA 
,0 Cloning" Vol. n. D.M. Glover ed., IRL Press Ltd, 1985. The oho.ce of transformtng 

conditions is determined by the host cell. Thus, a bacterial host such as E. coil may be treated 
„„h a solution of CaCW (Cohen er al, Prcc. Nat. Acad. Sci., .973,69,21 10, or with a so utton 
comprising a mtxture of RbCl, MnCb, potassium acetate and glycerol, and then w,,h 3-[N- 
uwpholmol-propane- su,phonic acid, RbCl and glycerol. Mammalian cells in culture may . 
25 be transformed by calcium co-precipitation of the vector DNA onto the cells. The invenuon 
a,so extends to a hos. eel! transformed or transfected with a repiicabie expression vector of the 
invention. Cul.uring the transformed host eel, under conditions permuting expression of 
,he DNA polymer is earned ou, conventionally, as described in. for example, Mamatts e, 
al and "DNA Cloning" cited above. Thus, preferably the cell is supplied with numen, 
30 and cultured a, a temperature below 45'C. The express.on produc. is recover, by 

conventional methods according to the host ce„. Thus, where the hos, cell ,s bacrenal. uch as 
E. coU it may be lysed physically, chemically or en Z ymatica„y and the prorem produc 
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be recovered from the periplasm* space or the nutrient medium. Where the host cell 
is mammalian, the product may generally be isolated from the nutrient medium. The DNA 
polymer may be assembled into vectors designed for isolation of stable transformed 
mammalian cell lines expressing the product: e.g. bovine papillomavirus vectors or amplified 
5 vectors in Chinese hamster ovary cells (DNA cloning Vol.11 D.M. Glover ed. IRL Press 1985; 
Kaufman, R.J., Molecular and Cellular Biology 5, 1750-1759, 1985; Pavlakis G.N. and 
Hamer, D.H., Proceedings of the National Academy of Sciences (USA) 80,397-401,1983; 
Goeddel, D.V. et al., and EP 0 093 619,1983). 

10 The immunoconjugates of the invention may comprise linker molecules. The linker 

is preferably made up of amino acids linked together by peptide bonds. Thus, in preferred 
embodiments, the linker is made up of from 1 to 20 amino acids linked by peptide bonds, 
wherein the amino acids are selected from the 20 naturally occurring amino acids. Some of 
these amino acids may be glycosylated, as is well understood by those in the art. In a 
15 more preferred embodiment, the 1 to 20 amino acids are selected from glycine, alanine, 
proline, asparagine, glutamine, and lysine. Even more preferably, a linker is made up of a 
majority of amino acids that are sterically unhindered, such as glycine and alanine. Thus, 
preferred linkers are polyglycines such as 
polyGly (particularly (Gly) 2 - (GlyM, 
20 poly(Gly-Ala), 
polyAla . 

Other specific examples of suitable linkers are: 

(Gly) 3 Lys(Gly)4 
(Gly) 3 AsnGlySer(Gl y) 2 
25 (Gly) 3 Cys(Gly) 4 and 
GlyProAsnGlyGly 

Combinations of Gly and Ala are also preferred. The linkers shown here are exemplary; 
Hnkers within the scope of this invention may be much longer and may include other 
residues. Non-peptide Hnkers are also possible. The peptide linkers may be altered to 
30 form derivatives in the same manner as described above. 

Preferred linkers of the invention are not or less immunogenic. Most of the above-cited hnker 
peptides are at least less immunogenic. However it is possible that creating the hnkage 
between an antibody or a sFv, Fab, Fab' or F(ab')2 or a Fc domain and the target protem via a 
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linker peptide molecule as mentioned above, new immunogenic epitopes may be newly 
created within the linkage region resulting man immunoconjugate which has an increased 
immunogenicity compared to the immunogenicity of the single (de-immunized) components. 
This situation can also be extended to fusion protein having no linker molecule. Therefore, the 
; invention also relates to de-immunized regions of a fusion protein according to the invention, 
the so-called fusion or junction regions. When fusing a first protein molecule with a second 
protein molecule (which also can be a linker molecule) via the C- and N-terminals a sequence 
region is created that is artificial and, thus, was usually not yet seen by the immune system. 
This region is deemed is be immunogenic. The region of amino acid residues compnse 
0 according to the invention approximately 10 residues of each protein terminal (N- or C- 
termina.) The complete fusion region comprises, therefore, about 20 amino acid rescues, 
preferably 2 - 16, more preferably 2 - 10 (which is 1 - 8 and 1 - 5 amino acid rescues, 
respectively, of each fusion partner). 

15 The invention includes also further Fc variants. Such further Fc vanants, one may remove one 
or more sites of a native Fc that provide structural features or functional activity not reqmred 
by the fusion molecules of this invention. One may remove these sites by, for example, - 
substituting or deleting residues, inserting residues into the site, or truncating poruons 
containing the site. The inserted or substituted residues may also be altered amino adds, such 

20 as peptidomimetics or D- amino acids. For example, one or more glycosylate sites may be 
removed Residues that are typically glycosylated (e.g., asparagine) may confer cytolytic 
response. Such residues may be deleted or substituted with unglycosylated residues (e.g., 
alanine). ADCC site as well as sites involved in interaction with complement, such as the Clq 
binding site, may also be removed if there is a specific need. 

^ The invention includes also derivatives of the target polypeptide (X) of the invention. Such 
derivatives may further improve the solubility, absorption, biological half life, and the like of 
(X) The modified (X) may alternatively eliminate or attenuate any undesirable side-effect and 
the like Exemplary derivatives include also compounds in which (X) or some portion thereof 
30 is cyclic. For example, the peptide portion may be modified to contain two or more Cys 

residues (e.g., in the linker), which could cyc.ize by disulfide bond formation. The compound 
is cross-linked or is rendered capable of cross-linking between molecules. For example, the 
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peptide portion may be modified to contain one Cys residue and thereby be able to form 
an intermolecular disulfide bond with a like molecule. 



In a final aspect the present invention relates to pharmaceutical compositions comprising said 
5 biologically active proteins obtainable by the methods disclosed in the present invention, and 
methods for therapeutic treatment of humans using the modified molecules and 
pharmaceutical compositions. 

Therapeutic compositions of the present invention contain a physiologically tolerable carrier 
together with the relevant agent as described herein, dissolved or dispersed therein as an active 
10 ingredient. As used herein, the terms "pharmaceutically acceptable", "physiologically 

tolerable" and grammatical variations thereof, as they refer to compositions, carriers, diluents 
and reagents, are used interchangeably and represent that the materials are capable of 
administration to or upon a mammal without the production of undesirable physiological 
effects such as nausea, dizziness, gastric upset and the like. The preparation of a 
15 pharmacological composition that contains active ingredients dissolved or dispersed therein is 
well understood in the art and need not be limited based on formulation. Typically, such 
compositions are prepared as injectables either as liquid solutions or suspensions, however, 
solid forms suitable for solution, or suspensions, in liquid prior to use can also be prepared. 
The preparation can also be emulsified. The active ingredient can be mixed with excipients 
20 which are pharmaceutically acceptable and compatible with the active ingredient and in 

amounts suitable for use in the therapeutic methods described herein. Suitable excipients are, 
for example, water, saline, dextrose, glycerol, ethanol or the like and combinations thereof. In 
addition, if desired, the composition can contain minor amounts of auxiliary substances such 
as wetting or emulsifying agents, pH buffering agents and the like which enhance the 
25 effectiveness of the active ingredient. 

The therapeutic composition of the present invention can include pharmaceutically acceptable 
salts of the components therein. Pharmaceutically acceptable salts include the acid addition 
salts (formed with the free amino groups of the polypeptide) that are formed with inorganic 
acids such as. for example, hydrochloric or phosphoric acids, or such organic acids as acetic, 
30 tartaric, mandelic and the like. Salts formed with the free carboxyl groups can also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, 
histidine, procaine and the like. Particularly preferred is the HCI salt when used in the 
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preparation of cyclic polypeptide ctv ^FtlMM <o,erab.e earners arc w=H 
know „ ,„ ,he art. Exempt of liquid carriers are srerile aqueous solutions «hat comam no 
m a,eria.s in addition to the active ingredients and water, or contain a buffer such as sodmrn 
phosphate a, physiological pH vaiue, physical saline or both, such as phosphaK-buffered 
Line. StiU further, aqueous earners can contain more than one buffer satt, as we., as sal«s 

such as sodium and potassium chlorides, dexrrose, polyethylene glyco, and other solu.es. 

Liquid compositions can aiso contain liquid phases in addition to and ,o .he exclusion 

of water. Exemplary of such additional liquid phaaes are glycerin, vegetable oris such 

as cottonseed oil. and water-oil emulsions. 

Typically, a rherapeutically effective amount of a modified Immunoglobulin in .he form of an 
m „dif,ed antibody or antibody fragmen. according <o tine invention is an amount such *a. 
when administered in physiologically tolerable composition is sufficient to achieve a plasma 
concentration of from about 0.01 microgram (pg) per milliliter (ml) to about 100 pg/ml, 
5 preferably from about 1 pgfm, to about 5 pg/m. and usually about 5 pg/ml. S,a,ed differently. 
thedosagecanvaryfromabouro.1 mg*g to about 300 mg*g, preferably from about 0.. 
ro g*g to about 200 mgAtg, most preferably from about 0.5 mg/kg to about 20 mgte m one or 
JL dose administrations daily for one or several days. Where the immunotherapeutic agent 
is in the form of a fragmen. of a monoclonal antibody or a conjugate, the amoun.can readriy 
» be adjuated based on .he mass of .he fragmen, / conjugate relative ,o .he mass of the 

whole antibody. A preferred plasma concentiation in molarity is from abou. 2 mtcromolar 
(pM) .o abou, 5 mi.limo.ar (mM) and preferab.y, abou, .00 pM to 1 mM antibody antagonist. 

A therapeutically effective amoun, of an agent according of ,his invention which is a „o„- 
25 immunotherapeutic peptide or a protein la typically an amoun. of such a molecu* s^h 

w he„ administered in a physio,og,ca„y tolerable composition is sufficient bo ach,eve plasma 
concentration of from abou, 0.1 microgram (pg) per mi.Hli,er (ml) to abou, 200 pg/ml 
preferably from abou. 1 pg/mltoabou. 15 0pgfml. Based on a protem having a «of abou, 
500 grams per mole, .he preferred plasma concentration in molarity is from abou, 2 
,0 micromolar (pM) to abou. 5 mi.l.molar (mM) and prefetably about 100 pM .0 1 mM 

of a Lee, with agents ma, reduce or avoid side effecs associated with ,he combmation 
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,he,apy of the present invention (-adjunctive therapy"), including, bu, not Umited to, those 
u-ents. for example, that reduce Ore toxic effect of announcer drugs. Said adjunctive agents 
prevent or reduce the incidence of nausea and vomiring associated with chemotherapy, 
radiotherapy or operation, or reduce the incidence of infecrion associated w„h the 
administration of mye.osupptessive anticancer drugs. Adjunctive agents are well Known ,n the 
art. The mod,f,ed proteins according to the invention can additional administered w,rh 
adjuvants like BCO and immune system slimularors. 

Furthermore, the compositions may include immunorherapeuric agen B , chemotherapeufc 
agents and anu-ueoplastic agents which may contain cytotoxic effective radio labeled 
isotopes, or other cyrotoxic agents, such as a cytotoxic peptides (e.g. cytokines) or cytotoxic 
dru^s and .he like. The typical dosage of an active agent, which is a preferably a chemical 
antagonist or a (chemical) chemotherapeudc agent according to the invention (nether an 
iram uno,herapeu„c agent nor a non-immunotherapeutic peptide/protein, is .0 mg to 1000 mg, 
preferably about 20 to 200 mg, and more preferably 50 to 100 mg per kilogram body wetgh. 



15 per day. 



The following examples describe the in invention in mote detail. However, mis listing does 
not limit the invention. 



20 EXAMPLE 1: 

The following example describes in detail a preferred method for idenuficat.cn of 
im munogenic sequence regions (T-cell epitopes) within the sequences of the fusion protems as 
disclosed in this invention. However, it should be pointed out, that said molecules can be 
obtained by other known methods. 



25 



30 



The identification of T-ce„ epitopes of me molecules which were modtfied in orter to obta.n 
the immunoconjugates according to the present invention can be achieved by d. fere*. 
m e t hodswhicharedescr,bedin,hepfiorart(W092yi07 5 5andW0 96/40792(Novo 

Nord,sk), EP 05 .9 596 (Merck ft Co.), EP 0699 755(Centro de tamunologta Moelcular). WO 
98/52976 and WO 98/59244 (Biovation Ltd.) or related methods. 
Advantageous immunoconjugates, however, can be obtained if the identification of satd 
eprtopes is realized by .he following new method which is described herewith in de.a„: 
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There are a number of factors that play important roles in determining the total structure of a 
protein polypeptide or immunoglobulin. First, the peptide bond, i.e., that bond which joms the 
amino acids in the chain together, is a covalent bond. This bond is planar in structure, 
essentially a substituted amide. An "amide" is any of a group of organic compounds 
containing the grouping -CONH-. 

The planar peptide bond linking Ca of adjacent amino acids may be represented as deleted 
below: 



\£ -;H 

/ 



Because the 0=C and the C-N atoms lie in a relatively rigid plane, free rotation does not occur 
0 about these axes. Hence, a plane schematically depicted by the intended line is somenmes 
referred to as an "amide" or "peptide plane" plane wherein lie the oxygen (O), carbon (C), 
nitrogen (N), and hydrogen (H) atoms of the peptide backbone. At opposite comers of th.s 
amide plane are located the Ca atoms. Since there is substantially no rotation about the G=C 
and C-N atoms in the peptide or amide plane, a polypeptide chain thus comprises a senes of 

1 5 planar peptide linkages joining the Ca atoms. 

A second factor that plays an important role in defining the total structure or conformauon of a 
polypeptide or protein is the angle of rotation of each amide plane about the common Ca 
linkage The terms "angle of rotation" and "torsion angle" are hereinafter regarded as 
equivalent terms. Assuming that the O, C, N. and H atoms remain in the amide plane (winch 
20 is usually a valid assumption, although there may be some slight deviations from p.ananty of 
these atoms for some conformations), these angles of rotation define the N and R 
polypeptide's backbone conformation, i.e., the structure as it exists between adjacent rescues. 
These two angles are known as * and ¥ . A set of the angles ¥l . where the subscript . 
represents a particular residue of a polypeptide chain, thus effectively defines the polypept.de 
25 secondary structure. The conventions used in defining the *, V angles, i.e.. the reference 

points at which the amide planes form a zero degree angle, and the definition of which angle ,s 
*, and which angle is V , for a given polypeptide, are defined in the literature. See, e.g„ 
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Ramachandran et a,. Adv. Pro,. CHen,. 2*283-437 (1968), « page. 285-94. which pages are 
incorporated herein by reference. 

The present method can be applied to any protein, and is baaed in pan upon the discovery rha, 
in humans the primary Pocket 1 anchor position of MHC Claas H molecule binding grooves 
has a weU designed specificity for pamcoiar amino acid side chains. The specificity of th,s 
pocket is determined by the identity of the amino acid at position 86 of the beta chain of the 
MHC Claas D molecule. This site ia located a, the bottom of Pocket 1 and derermines the stae 
of me side chain that can be accommodated by this pocket. Marshall. K.W., / /—>... 
152:4946-4956 (1994). If this residue is a glycine, then all hydrophobic aliphatic and aromauc 
0 ammo acids (hydrophobic aliphatics being: valine, leucine, iaoleucine. methionine and 

aromatics being: phenylalanine, ryrosine and tryptophan, can be accommodated in the pocket, 
a preference being for the aromatic side chains. If this pocket residue is a valine, then the s.de 
chain of this amino acid protrudes into the pocket and restricts the size of peptide s,de chams 
fta, can be accommodated such that only hydrophobic aliphatic side chains can be 
15 accommodated. Therefore, in an amino acid residue sequence, wherever an ammoacid ^w„h a 
hydrophobic aliphauc or aromatic side chain is found, there is the potential for a MHC Class 
restricted T-cell epitope to be present. If ft. side-chain is hydrophobic alipbaric, however, ,« ts 
approximately twice as likely ,0 be associated with a T-cel, epitope than an aronuttc std. 
chain (assuming an approximately even distribution of Pocket 1 types throughout the global 

M ZlnpuLnal method embodying the present invention profiles the likelihood of peptide 
regions to contain T-cell epitopes as follows: 

(1) The primary sequence of a peptide segment of predetermined length is scanned, and aU 
hydrophobic aliphatic and aromatic side chains present are identified. (2)Th. hydrophob.c 
25 aliphatic side chains are assigned a value greater man that for the aromatic side chams; 

preferably about twice the value assigned to the aromatic side chains, e.g.. a value of 2 or a 
hydrophobic aliphatic side chain and a value of 1 for an aromabc side chain. (3) The values 
determined ,o be present are summed for each overlapping amino acid residue segment 
(window) of predetermined unifotm lengrh within the peptide, and the total value for a 
30 particular segment (window) is assigned to a smgle amino acid residue a. an in.rmed.ate 

position of the segmen, (window), preferably to a restdue a, about the midpoint of the sampled 
segment (window). This procedure is repeated for each sampled overlapping ammo actd 
resrdue segmen. (window). Thus, each am,„o acid residue of rhe pepride is assigned a value 
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that relates to the likelihood of a T-cell epitope being present in that particular segment 
(window) (4) The values calculated and assigned as described in Step 3, above, can be plotted 
against the amino acid coordinates of the entire amino acid residue sequence being assessed. 
(I) All portions of the sequence which have a score of a predetermined value, e.g.. a value of 
5 1 are deemed likely to contain a T-cell epitope and can be modified, if desired. 

This particular aspect of the present invention provides a general method by which the regions 
of peptides likely to contain T-cell epitopes can be described. Modifications to the pept.de in 
these regions have the potential to modify the MHC Class n binding charactenst.es. 
According to another aspect of the present invention. T-cell epitopes can be predicted w.th 
10 greater accuracy by the use of a more sophisticated computational method which takes mto 
account the interactions of peptides with models of MHC Class H alleles. 
The computational prediction of T-cell epitopes present within a peptide accordmg to th.s 
particular aspect contemplates the construction of models of at least 42 MHC Class H alleles 
based upon the structures of all known MHC Class II molecules and a method for the use of 
15 these models in the computational identification of T-cell epitopes, the constructs of 
libraries of peptide backbones for each model in order to allow for the known variability in 
relative peptide backbone alpha carbon (Ccc) positions, the construction of libraries of ammo- 
acid side chain conformations for each backbone dock with each model for each of the 20 
amino-acid alternatives at positions critical for the interaction between peptide and MHC 
20 Class n molecule, and the use of these libraries of backbones and side-chain conformations m 
conjunction with a scoring function to select the optimum backbone and s.de-cham 
conformation for a part.cular peptide docked with a particular MHC Class H molecule and the 
derivation of a binding score from this interaction. 

Models of MHC Class U molecules can be derived via homology modeling from a number of 
25 similar structures found in the Brookhaven Protein Data Bank ("PDB")- These may be made 
by the use of semi-automatic homology modeling software (Modeller. Sali A. & Blundell TL.. 
1993 J. Mol Biol 234:779-815) which incorporates a simulated annealing funct.on, in 
conjunction with the CHARMm force-field for energy minimisation (available from 
Molecular Simulations Inc.. San Diego. C). Alternative modeling methods can be utilized as 

30 The present method differs significantly from other computational methods which use libraries 
of experimentally derived binding data of each amino-acid alternative at each position in the 
binding groove for a small set of MHC Class H molecules (Marshall. K.W.. et al, Biomed. 
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Pep, Prceins Nucleic Act*. 1(3): 157-162) (1995) or yet Cher computational methods which 
use similar experiment biading data ia order .0 defiae .he biadiag characteristics of 
particular types of binding pockets withia the groove, agaia using a relative* smal. subset 
MHC Class n molecules, and rhen 'mixing and matching' pocke. rypes from rhis pocket 
library .o artificially create further 'virtual' MHC Cass n molecules (Stnmiolo T„ «al. No, 
Biouck 12(6): 555-561 (1999). Borh prior merhods suffer the major disadvantage that, due o 
rh. complexity of the assays and the need to synthesis large numbers of peptide variant, only 
a smal, number of MHC Class II modules can be experimentally scanned. Therefore rhe firs, 
pri o, merhod can only make predictions for a smal. number of MHC Class n mo.ecu.es. The 
„ lond pnor merhod also makes the assumption tha, a pocke, lined with similar ammo-actds 
one molecule wil, have the same binding characteristics when in ,he con,ex, of a d,ffere„, 
Cass fi allele and suffers further disadvanuges in ,ha, only ,hose MHC Class n molecules can 
be 'virtually' crea,ed which con,a,n pocker, confined wi,hin ,he pocke, library. Ustng me 
modeling approach described herein, ,he structure of any number and type of MHC Class 
1S molecules can be deduced, therefore alleles can be specifically selecred ,0 be represemative of 
global population. In addition, me number of MHC Class fi molecules scanned can be 
increased by making further models further man having ,0 generate additional da,a v,a 
complex experimentation. 

The use of a backbone library allows for variation in the positions of <he Cct a,oms of the 
» various peptides being scanned when docked wirh particular MHC Cass II molecu e, T ta ts 
gain in Itias, ,0 ,he ahemative prior computation, me,hods described above w ten rely on 
L use of simplified peptide backbones for scanning amino-acid binding in particular pocke.. 
Tb.se simpltfied backbones are no, likely ,0 be represe„,a,ive o, 
found in 'real' peptides ,ead,„g ,0 inaccuracies in prediction of peptide 
25 backbone library ,s crea,ed by superposing ,he backbones of a„ peptides bound to MHC Class 
„ m olecu,es found within Ute Pro,ein Da,a Bank and noting the roo, mean square (RMS) 
deviation between tine Cot atoms of each of the eleven antino-acids located within the bmdmg 
groove. While this library can be derived from a small number of suitable available mouse 
1 human structures (currently 13). in order to allow for the possibility of even greater 
30 variability, the RMS figure for each C"-« position is increased by 50%. The avera* : Ca 
position of each amino-acid is then de,erm,ned and a sphere drawn around ,h,s po,n whose 
» dius equals the RMS delation at that posttion plus 50%. Th,s sphere repreaenu all allowed 



Ca positions. 
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Working from the Cot with the least RMS deviation (that of the amino-acid in Pocket 1 as 
mentioned above, equivalent to Position 2 of the 1 1 residues in the binding groove), the sphere 
is three-dimensionally gridded, and each vertex within the grid is then used as a possible 
location for a Ca of that amino-acid. The subsequent amide plane, corresponding to the 
, peptide bond to the subsequent amino-acid is grafted onto each of these Cas and the + and w 
angles are rotated step-wise at set intervals in order to position the subsequent Ca. If the 
subsequent Ca falls within the 'sphere of allowed positions' for this Ca than the orientation of 
the dipeptide is accepted, whereas if it falls outside the sphere then the dipeptide is rejected. 
This process is then repeated for each of the subsequent Ca positions, such that the pept.de 
0 grows from the Pocket 1 Ca 'seed' , until all nine subsequent Cas have been positioned from 
all possible permutations of the preceding Cos. The process is then repeated once more for 
the single Ca preceding pocket 1 to create a library of backbone Ca positions located within 
the binding groove. 

The number of backbones generated is dependent upon several factors: The size of the 
13 'spheres of allowed positions' ; the fineness of the gridding of the 'primary sphere' at the 

Pocket 1 position; the fineness of the step-wise rotation of the + and » angles used to position 
subsequent Cos. Using this process, a large library of backbones can be created. The larger 
the backbone library, the more likely it will be that the optimum fit will be found for a 
particular peptide within the binding groove of an MHC Class H molecule. Inasmuch as all 
20 backbones will not be suitable for docking with all the models of MHC Class 0 molecules due 
to clashes with amino-acids of the binding domains, for each allele a subset of the library ,s 
created comprising backbones which can be accommodated by that allele. The use of the 
backbone library, in conjunction with the models of MHC Class H molecules creates an 
exhaustive database consisting of allowed side chain conformations for each amino-acid in 
25 each position of the binding groove for each MHC Class H molecule docked with each 
. allowed backbone. This data set is generated using a simple steric overlap function where a 
MHC Class H molecule is docked with a backbone and an amino-acid side chain ,s grafted 
onto the backbone at the desired position. Each of the rotatable bonds of the side chain is 
rotated step-wise at set intervals and the resultant positions of the atoms dependent upon that 

... r-:j- ~u^;«c rhp hindine eroove is 



30 



rotated step-wise at ^ m^***.* 

bond no.ed. The interaction of the atom with atoms of side-chains of the bindtng groove ,s 
noted and positions are either accepred or rejected aceordtng to the foUowing criteria: The son, 
.oral of the overlap of ail aronts so far positioned must no. exceed a pre-detemrined value. 



Page 40 of 92 



PCT/EP02/01690 

WO 02/066514 

- 39 - 

Thus the stringency of the conformational search is a function of the interval used in the step- 
wise rotation of the bond and the pre-determined limit for the total overlap. This latter value 
can be small if it is known that a particular pocket is rigid, however the stringency can be 
relaxed if the positions of pocket side-chains are known to be relatively flexible. Thus 
5 allowances can be made to imitate variations in flexibility within pockets of the binding 

.roove This conformational search is then repeated for every amino-acid at every position of 
each backbone when docked with each of the MHC Class U molecules to create the exhaustive 
database of side-chain conformations. 

A suitable mathematical expression is used to estimate the energy of binding between models 
10 of MHC Class U molecules in conjunction with peptide ligand conformations which have to 
be empirically derived by scanning the large database of backbone/side-chain conformations 
described above. Thus a protein is scanned for potential T-cell epitopes by subjecting each 
possible peptide of length varying between 9 and 20 amino-acids (although the length is kept 
constant for each scan) to the following computations: An MHC Class H molecule is selected 
15 together with a peptide backbone allowed for that molecule and the side-chains corresponding 
to the desired peptide sequence are grafted on. Atom identity and interatomic distance data 
relating to a particular side-chain at a particular position on the backbone are collected for 
each allowed conformation of that amino-acid (obtained from the database described above). 
This is repeated for each side-chain along the backbone and peptide scores derived using a 
,0 scoring function. The best score for that backbone is retained and the process repeated for each 
allowed backbone for the selected model. The scores from all allowed backbones are 
compared and the highest score is deemed to be the peptide score for the desired peptide in 
that MHC Class H model. This process is then repeated for each mode, with every possible 
peptide derived from the protein being scanned, and the scores for peptides versus models are 
25 displayed. 

In the context of the present invention, each ligand presented for the binding affinity 
calculation is an amino-acid segment selected from a peptide or protein as discussed above. 
Thus, the ligand is a selected stretch of amino acids about 9 to 20 amino acids in length ^ 
derived from a peptide, polypeptide or protein of known sequence. The terms "amino acids 
30 and "residues" are hereinafter regarded as equivalent terms. The ligand, in the form of the 
consecutive amino acids of the peptide to be examined grafted onto a backbone from the 
backbone library, is positioned in the binding cleft of an MHC Class II molecule from the 
MHC Class H molecule model library via the coordinates of the C»-a atoms of the peptide 



WO02066514 [file:/A\d cwasQ3\firrndata\lp\FoleyPat\PatentDocurnents\WO02066514.cpc1 Pag e 41 of 92 

WO 02/066514 PCT/EP02/01690 

- 40 - 

backbone and an allowed conformation for each side-chain is selected from the database of 
allowed conformations. The relevant atom identities and interatomic distances are also 
retrieved from this database and used to calculate the peptide binding score. Ligands with a 
high binding affinity for the MHC Class II binding pocket are flagged as candidates for site- 
5 directed mutagenesis. Amino-acid substitutions are made in the flagged ligand (and hence in 
the protein of interest) which is then retested using the scoring function in order to determine 
changes which reduce the binding affinity below a predetermined threshold value. These 
changes can then be incorporated into the protein of interest to remove T-cell epitopes. 
Binding between the peptide ligand and the binding groove of MHC Class II molecules 
10 involves non-covalent interactions including, but not limited to: hydrogen bonds, electrostatic 
interactions, hydrophobic (lipophilic) interactions and Van der Walls interactions. These are 
included in the peptide scoring function as described in detail below. It should be understood 
that a hydrogen bond is a non-covalent bond which can be formed between polar or charged 
groups and consists of a hydrogen atom shared by two other atoms. The hydrogen of the 
15 hydrogen donor has a positive charge where the hydrogen acceptor has a partial negative 

charge. For the purposes of peptide/protein interactions, hydrogen bond donors may be either 
nitrogens with hydrogen attached or hydrogens attached to oxygen or nitrogen. Hydrogen 
bond acceptor atoms may be oxygens not attached to hydrogen, nitrogens with no hydrogens 
attached and one or two connections, or sulphurs with only one connection. Certain atoms, 
20 such as oxygens attached to hydrogens or imine nitrogens (e.g. C=NH) may be both hydrogen 
acceptors or donors. Hydrogen bond energies range from 3 to 7 Kcal/mol and are much 
stronger than Van der Waal's bonds, but weaker than covalent bonds. Hydrogen bonds are 
also highly directional and are at their strongest when the donor atom, hydrogen atom and 
acceptor atom are co-linear. Electrostatic bonds are formed between oppositely charged ion 
25 pairs and the strength of the interaction is inversely proportional to the square of the distance 
between the atoms according to Coulomb's law. The optimal distance between ion pairs is 
about 2.8A. In protein/peptide interactions, electrostatic bonds may be formed between 
arginine, histidine or lysine and aspartate or glutamate. The strength of the bond will depend 
upon the pKa of the ionizing group and the dielectric constant of the medium although they 
30 are approximately similar in strength to hydrogen bonds. 

Lipophilic interactions are favorable hydrophobic-hydrophobic contacts that occur between he 
protein and peptide ligand. Usually, these will occur between hydrophobic amino acid side 
chains of the peptide buried within the pockets of the binding groove such that they are not 
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exposed to solvent. Exposure of the hydrophobic residues to solvent is highly unfavorable 
since the surrounding solvent molecules are forced to hydrogen bond with each other forming 
cace-like clathrate structures. The resultant decrease in entropy is highly unfavorable. 
Lipophilic atoms may be sulphurs which are neither polar nor hydrogen acceptors and carbon 

i atoms which are not polar. 

Van der Waal's bonds are non-specific forces found between atoms which are 3-4A apart. 
They are weaker and less specific than hydrogen and electrostatic bonds. The distribution of 
electronic charge around an atom changes with time and, at any instant, the charge distribute 
is not symmetric. This transient asymmetry in electronic charge induces a similar asymmetry 
0 in neighboring atoms. The resultant attractive forces between atoms reaches a maximum at 
the Van der Waal's contact distance but diminishes very rapidly at about 1 A to about 2A. 
Conversely, as atoms become separated by less than the contact distance, increasingly strong 
repulsive forces become dominant as the outer electron clouds of the atoms overlap. Although 
the attractive forces are relatively weak compared to electrostatic and hydrogen bonds (about 
15 0.6 Kcal/mol), the repulsive forces in particular may be very important in determining whether 
a peptide ligand may bind successfully to a protein. 

In one embodiment, the BOhm scoring function (SCORE1 approach) is used to estimate the 
binding constant. (Bohm, H.J., J. Cornput Aided Mol. De,, 8(3):243-256 (1994) which is 
hereby incorporated in its entirety). In another embodiment, the scoring funcUon (SCORE2 
20 approach) is used to estimate the binding affinities as an indicator of a ligand contammg a T- 
cel. epitope (Bohm, H.J., J. Camp* Aided Mol. De,. 12(4):309-323 (1998) which is hereby 
incorporated in its entirety). However, the B5hm scoring functions as described in the above 
references are used to estimate the binding affinity of a ligand to a protein where it is already 
known that the ligand successfully binds to the protein and the protein/1 igand complex has had 
25 its structure solved, the solved structure being present in the Protein Data Bank ("PDB"). 

Therefore, the scoring function has been developed with the benefit of known positive bindmg 
data In order to allow for discrimination between positive and negative binders, a repulse 
term must be added to the equation. In addition, a more satisfactory estimate of binding 
energy is achieved by computing the lipophilic interactions in a pairwise manner rather than 
30 using the area based energy term of the above BOhm functions. Therefore, in a preferred 

embodiment, the binding energy is estimated using a modified Bohm scoring function. In the 
modified B6hm scoring function, the binding energy between protein and ligand (AG bind ) is 
estimated considering the following parameters: The reduction of binding energy due to the 
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overall loss of mutational and rotational entropy of the ligand (AG 0 ); contributions from ideal 
hydrogen bonds (AC™) where at least one partner is neutral; contributions from unperturbed 
ionic interactions (AG ionic ); lipophilic interactions between lipophilic ligand atoms and 
lipophilic acceptor atoms (AG lip0 ); the loss of binding energy due to the freezing of internal 
5 degrees of freedom in the ligand, i.e., the freedom of rotation about each C-C bond is reduced 
(AGrot); the energy of the interaction between the protein and ligand (Evcw). Consideration of 
these terms gives equation 1: 

Where N is the number of qualifying interactions for a specific term and, in one embodiment. 
10 AGo, AG hb) AG, nic , AG lipo and AG rot are constants which are given the values: 5.4, -4.7, -4.7, - 
0.17, and 1.4, respectively. 

The term N hb is calculated according to equation 2 : 

N hb = Ih-bondsf (AR. A(X) X f (Nneighb) X fpcs 

f(AR, Aa) is a penalty function which accounts for large deviations of hydrogen bonds from 
15 ideality and is calculated according to equation 3 : 
f (AR, A-cc) = f 1 (AR) x f2(Aa) 
Where: f 1 (AR) = 1 if AR <= TOL 

or =1 - (AR - TOD/0.4 if AR <= 0.4 + TOL 

or =0 if AR >0.4 + TOL 

20 And: f2(Aa) = 1 if Aa <30° 

or =l-( Aa - 30) /50 if Aa <=80° 
or =0 if Aa >80° 
TOL is the tolerated deviation in hydrogen bond length = 0.25A 
AR is the deviation of the H-O/N hydrogen bond length from the ideal value = 1.9 A 
25 Aa is the deviation of the hydrogen bond angle Z n/0 .h..o/n from its idealized value of 180° 
f(N neighb ) distinguishes between concave and convex parts of a protein surface and therefore 
assigns greater weight to polar interactions found in pockets rather than those found at the 
protein surface. This function is calculated according to equation 4 below: 
f(N n , i9hb ) = (N nel9hb /N nei9hb .o) ° where a = 0.5 
30 N neighb is the number of non-hydrogen protein atoms that are closer than 5A to any g.ven 

protein atom. 

Nncighb.o is a constant = 25 
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fpcs is a function which allows for the polar contact surface area per hydrogen bond and 
therefore distinguishes between strong and weak hydrogen bonds and its value is determined 
according to the following criteria: 
*- pes*" when Apdar/NHB < 10 A 2 
5 or f pcs = 1 when Apour/N H B > 10 A 2 

Apoiar is the size of the polar protein-ligand contact surface 
Nhb is the number of hydrogen bonds 
6 is a constant whose value = 1.2 

For the implementation of the modified Bohm scoring function, the contributions from ionic 
10 interactions, AGi 0n ic> are computed in a similar fashion to those from hydrogen bonds described 
above since the same geometry dependency is assumed. 
The term Nu po is calculated according to equation 5 below: 

Niip 0 = Xii,f ( r iL) 

f(rj L ) is calculated for all lipophilic ligand atoms, 1, and all lipophilic protein atoms, L, 
15 according to the following criteria: 

f (r 1L ) =1 when r 1L <= Rlf (r 1L ) =(r 1L - R1)/(R2-R1) when R2 <r 1L > Rl 

f (ri L ) =0 when n L >= R2 

Where: Rl = r x vdw + r L vdw +0.5 

and R2 = Rl + 3 . 0 
20 and ri vdw is the Van der Waal's radius of atom 1 

and r L vdw is the Van der Waal's radius of atom L 

The term N rot is the number of rotable bonds of the amino acid side chain and is taken to be the 

number of acyclic sp 3 - sp 3 and sp 3 - sp 2 bonds. Rotations of terminal -CH3 or -NH3 are not 

taken into account. 
25 The final term, E V dw, is calculated according to equation 6 below: 

Evdw = 6x62 ((r 1 vdw +r 2 vdw ) 12 /r 12 - ( ri vdw +r 2 vdw ) 6 /r 6 ) . where: 

£1 and £2 are constants dependant upon atom identity 

ri vdw +r 2 vdw are the Van der Waal's atomic radii 

r is the distance between a pair of atoms. 
30 With regard to Equation 6, in one embodiment, the constants £1 and £ 2 are given the atom 

values: C: 0.245, N: 0.283, O: 0.316, S: 0.316, respectively (i.e. for atoms of Carbon, 

Nitrogen, Oxygen and Sulphur, respectively). With regards to equations 5 and 6, the Van der 

Waal's radii are given the atom values C: 1.85, N: 1.75, O: 1.60, S: 2.00A. 
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It should be understood that all predetermined values and constants given in the equations 
above are determined within the constraints of current understandings of protein ligand 
interactions with particular regard to the type of computation being undertaken herein. 
Therefore, it is possible that, as this scoring function is refined further, these values and 
5 constants may change hence any suitable numerical value which gives the desired results in 
terms of estimating the binding energy of a protein to a ligand may be used and hence fall 
within the scope of the present invention. 

As described above, the scoring function is applied to data extracted from the database of side- 
chain conformations, atom identities, and interatomic distances. For the purposes of the 
10 present description, the number of MHC Class E molecules included in this database is 42 
models plus four solved structures. It should be apparent from the above descriptions that the 
modular nature of the construction of the computational method of the present invention 
means that new models can simply be added and scanned with the peptide backbone library 
and side-chain conformational search function to create additional data sets which can be 
15 processed by the peptide scoring function as described above. This allows for the repertoire of 
scanned MHC Class II molecules to easily be increased, or structures and associated data to be 
replaced if data are available to create more accurate models of the existing alleles. 
The present prediction method can be calibrated against a data set comprising a large number 
of peptides whose affinity for various MHC Class E molecules has previously been 
20 experimentally determined. By comparison of calculated versus experimental data, a cut of 
value can be determined above which it is known that all experimentally determined T-cell 
epitopes are correctly predicted. 

It should be understood that, although the above scoring function is relatively simple 
compared to some sophisticated methodologies that are available, the calculations are 

25 performed extremely rapidly. It should also be understood that the objective is not to calculate 
the true binding energy per se for each peptide docked in the binding groove of a selected 
MHC Class E protein. The underlying objective is to obtain comparative binding energy data 
as an aid to predicting the location of T-cell epitopes based on the primary structure (i.e. 
amino acid sequence) of a selected protein. A relatively high binding energy or a binding 

30 energy above a selected threshold value would suggest the presence of a T-cell epitope in the 
ligand. The ligand may then be subjected to at least one round of amino-acid substitution and 
the binding energy recalculated. Due to the rapid nature of the calculations, these 
manipulations of the peptide sequence can be performed interactively within the program's 
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user interface on cost-effectively available computer hardware. Major investment in computer 
hardware is thus not required. 

It would be apparent to one skilled in the art that other available software could be used for the 
same purposes. In particular, more sophisticated software which is capable of docking ligands 

5 into protein binding-sites may be used in conjunction with energy minimization. Examples of 
docking software are: DOCK (Kuntz et aL, J. Mol Biol, 161:269-288 (1982)), LUDI (B6hm, 
H.J., J. Comput Aided Mol Des. t 8:623-632 (1994)) and FLEXX (Rarey M., et al, ISMB, 
3:300-308 (1995)). Examples of molecular modeling and manipulation software include: 
AMBER (Tripos) and CHARMm (Molecular Simulations Inc.). The use of these 

10 computational methods would severely limit the throughput of the method of this invention 
due to the lengths of processing time required to make the necessary calculations. However, it 
is feasible that such methods Qould be used as a Secondary screen' to obtain more accurate 
calculations of binding energy for peptides which are found to be 'positive binders' via the 
method of the present invention. The limitation of processing time for sophisticated molecular 

15 mechanic or molecular dynamic calculations is one which is defined both by the design of the 
software which makes these calculations and the current technology limitations of computer 
hardware. It may be anticipated that, in the future, with the writing of more efficient code and 
the continuing increases in speed of computer processors, it may become feasible to make 
such calculations within a more manageable time-frame. Further information on energy 

20 functions applied to macromolecules and consideration of the various interactions that take 
place within a folded protein structure can be found in: Brooks, B.R., et al, J, Comput Chem. 7 
4:187-217 (1983) and further information concerning general protein-ligand interactions can 
be found in: Dauber-Osguthorpe et al., Pr0fems4(l):31-47(1988), which are incorporated 
herein by reference in their entirety. Useful background information can also be found, for 

25 example, in Fasman, G.D., ed., Prediction of Protein Structure and the Principles of Protein 
Conformation, Plenum Press, New York, ISBN: 0-306 4313-9. 

EXAMPLE 2: De-immunized forms ofFc-Leptin 

Leptin is a secreted signaling 146 amino acid residue protein involved in the homeostatic 
30 mechanisms maintaining adipose mass (e.g. WO 00/40615, WO 98/28427, WO 96/05309). 
The protein (and its antagonists) offers significant therapeutic potential for the treatment of 
diabetes, high blood pressure and cholesterol metabolism. 
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Fc-leptin is a fusion protein for which the serum half-life is profoundly improved compared to 
leptin itself (WO 0040615). However, certain forms of Fc-leptin, such as when the Fc is 
derived from human IgGl or human IgG3, have the potential to show enhanced 
immunogenicity under certain circumstances, such as administration by subcutaneous 
injection. In a Phase I clinical trial, leptin alone was found to be at least somewhat 
immunogenic. The invention discloses sequences identified within the leptin primary sequence 
that are potential T-cell epitopes by virtue of MHC class II binding potential. This disclosure 
specifically pertains to the human leptin moiety containing about 146 amino acid residues. 
Others have provided modified leptin (US, 5,900,404; WO96/05309) but these approaches 
have been directed towards improvements in the commercial production of leptin, for example 
improved in vitro stability. Such teachings do not recognize the importance of T-cell epitopes 
to the immunogenic properties of the protein nor have been conceived to directly influence 
said properties in a specific and controlled way according to the scheme of the present 
invention. Specific Fc-leptin forms: Fcyl-leptin, FcY2-leptin, both forms, preferably with 
linker peptide and optionally modified Fc domain having reduced affinity to Fc-receptors. 
Sequences to be modified in leptin are shown below: 

An amino acid sequence which is part of the sequence of an immunogenically non-modified 
human obesity protein (leptin) and has a potential MHC class II binding activity is selected 
from the following group identified according to the method of the invention: 
Peptide sequences in human leptin with potential human MHC class II binding activity, 

V P I Q KVQ DDTKTL , QKVQDDTKTLIKT , KTLIKTIVTRIND, TLIKTIVTRINDI , 

TRINDISHTQSVS, NDI SHTQSVSSKQ , 
QKVTGLDFIPGLH, 



KTIVTRINDISHT 
QSVS SKQKVTGLD 
LDFIPGLHPILTL 
H P I LTLS KMDQTL 
QTLAVYQQILTSM 
QILTSMPSRNVIQ 
NVIQISNDLENLR 
ENLRDLLHVLAFS 
HVLAFSKSCHLPW 
DSLGGVLEASGYS 
TEWALSRLQGSL 



TIVTRINDISHTQ, 
SSKQKVTGLDFIP, 
DFIPGLHPILTLS, 
P I LTLSKMDQTLA , 
LAVYQQILTSMPS , 
TSMPSRNVIQISN, 
IQISNDLENLRDL, 
RDLLHVLAFSKSC , 



TGLDFIPGLHPIL, 
PGLHPILTLSKMD GLH P I LTLSKMDQ , 
LTLSKMDQTLAVY , SKMDQTLAVYQQI , 
AVYQQILTSMPSR, QQILTSMPSRNVI , 
SRNVIQISNDLEN , RNVIQISNDLENL , 



NDLENLRDLLHVL , 
DLLHVLAFSKSCH, 



LAFSKSCHLPWAS , CHLPWASGLETLD, 

S LGGVL EASGYST , GGVLEASGYSTEV, 

EWALSRLQGSLQ, VALSRLQGSLQDM, 

GSLQDMLWQLDLS , QDMLWQLDLS PGC 



LENLRDLLHVLAF , 
LHVLAFSKSCHLP, 
SGLETLDSLGGVL , 
SGYSTEWALSRL , 
SRLQGSLQDMLWQ, 



35 



QGSLQDMLWQLDL 

Substitutions leading to the elimination of potential T-cell epitopes of human leptin (WT = 
wild type). 
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Any of the above-cited peptide sequences can be used for modifying by exchanging one or 
more amino acids to obtain a sequence having a reduced or no immunogenicity . 
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EXAMPLE 3: De-immunized forms ofFc-IL-IRa 

The present invention provides for modified forms of an interleukin-1 receptor antagonist (IL- 
lRa) with one or more T-cell epitopes removed. IL-1 is an important inflammatory and 
immune modulating cytokine with pleiotropic effects on a variety of tissues but may 
5 contribute to the pathology associated with rheumatoid arthritis and other diseases associated 
with local tissue damage. An IL-1 receptor antagonist able to inhibit the action of EL-1 has 
been purified and the gene cloned [Eisenburg S.P. et al (1990) Nature, 343 : 341-346; Carter, 
D.B. et al (1990) Nature, 344: 633-637]. Othere have provided EL-lRa molecules [e.g. US 
5,075,222]. Recombinant forms of this protein have therapeutic potential in disease settings 

10 where the effects of EL-1 are deleterious. However, there remains a continued need for EL-lRa 
analogues with enhanced properties. Desired enhancements include alternative schemes and 
modalities for the expression and purification of the said therapeutic, but also and especially, 
improvements in the biological properties of the protein. There is a particular need for 
enhancement of the in vivo characteristics when administered to the human subject. In this 

15 regard, it is highly desired to provide EL-lRa with reduced or absent potential to induce an 
immune response in the human subject. Such proteins would expect to display an increased 
circulation time within the human subject and would be of particular benefit in chronic or 
recurring disease settings such as is the case for a number of indications for IL-lRa. The 
present invention provides for modified forms of EL-lRa proteins that are expected to display 

20 enhanced properties in vivo. This disclosure specifically pertains a human EL-lRa protein 
being of 152 amino acid residues (Eisenburg, S.P. et al (1991) Proc. NatL Acad, Sci. U.S.A. 
88: 5232-5236). 

Specific Fc- EL-lRa forms: Fcyl- IL-lRa, Fcy2- EL-lRa, both forms, preferably with linker 
peptide and optionally modified Fc domain having reduced affinity to Fc-receptors. 
25 Peptide sequences in human interleukin-1 receptor antagonist (IL-1RA) with potential human 
MHC class II binding activity. 

RKSSKMQAFRIWD , SKMQAFRIWDVNQ , 

RIWDVNQKTF YLR , IWDVNQKTFYLRN , 

TFYLRNNQLVAGY , FYLRNNQLVAGYL , 

30 NQLVAGYLQGPNV , QLVAGYLQGPNVN , 

GYLQGPNVNLEEK , PNVNLEEKIDWP , 

IDWPIEPHALFL , DWPIEPHALFLG , 

ALFLGIHGGKMCL , LFLGIHGGKMCLS , 

MCLSCVKSGDETR, SCVKSGDETRLQL , 



QAFRIWDVNQKTF , 
WDVNQKTFYLRNN , 
LRNNQLVAGYLQG , 
LVAGYLQGPNVNL, 
VNLEEKIDWPIE, 
VPIEPHALFLGIH, 
LGIHGGKMCLSCV, 
ETRLQLEAVNITD , 



FRIWDVNQKTFYL, 
KTFYLRNNQLVAG , 
RNNQLVAGYLQGP , 

AGYLQGPNVNLEE , 
EKIDWPIEPHAL, 
HALFLGIHGGKMC, 

GKMCLSCVKSGDE, 
TRLQLEAVNITDL , 
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LQLEAVNITDLSE, 
ENRKQDKRFAFIR, 
TSFESAACPGWFL , 
TAMEADQPVSLTN , 
PDEGVMVTKFYFQ , 



EAVNITDLS ENRK , 
KRFAFIRSDSGPT, 
SFESAACPGWFLC , 
QPVSLTNMPDEGV, 
EGVMVTKFYFQED , 



- 49 - 

VNITDLS ENRKQD , TDLSENRKQDKRF , 

FAFIRSDSGPTTS , AFIRSDSGPTTSF , 
PGWFLCTAMEADQ , WFLCTAMEADQPV, 
VSLTNMPDEGVMV, TNMPDEGVMVTKF , 



GVMVTKFYFQEDE 

Substitutions leading to the elimination of potential T-cell epitopes of human interleukin-l 
receptor antagonist (IL- IRA) (WT = wild type). 



Residue 
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EXAMPLE 4: De-immunized forms of Fc-BNDF 

The present invention provides for modified forms of human brain-derived neurotrophic factor 
(BNDF) with one or more T-cell epitopes removed. BNDF is glycoprotein of the nerve 
growth factor family of proteins. The mature 1 19 amino acid glycoprotein is processed from a 
larger pre-cursor to yield a neutrophic factor that promotes the survival of neuronal cell 
populations [Jones K.R. & Reichardt, L.F. (1990) Proc. Natl. Acad. Sci U.S.A. 82: 8060- 
8064]. Others have provided modified BNDF molecules [US, 5,770,577] and approaches 
towards the commercial production of recombinant BNDF molecules [US, 5,986,070]. Such 
neuronal cells are all located either in the central nervous system or directly connected to it. 
Recombinant preparations of BNDF have enabled the therapeutic potential of the protein to be 
explored for the promotion of nerve regeneration and degenerative disease therapy. 
Specific Fc- BNDF forms: Fcyl- BNDF, Fcy2- BNDF, both forms, preferably with linker 
peptide and optionally modified Fc domain having reduced affinity to Fc-receptors. 
Peptide sequences in human brain-derived neutrophic factor (BDNF) with potential human 
MHC class II binding activity. 
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EWVTAADKKTAVD , 
VDMSGGTVTVLEK, 
EKVPVSKGQLKQY, 
QYFYETKCNPMGY , 
RGIDKRHWNSQCR, 
S YVRALTMDS KKR , 
IGWRFIRIDTSCV, 



LSVCDSISEWVTA, 
WVTAADKKTAVDM , 
GTVTVLEKVPVSK , 
VPVSKGQLKQYFY, 
YFYETKCNPMGYT , 
RHWNSQCRTTQSY, 
RALTMDS KKRIGW , 
GWRFIRIDTSCVC, 



DSISEWVTAADKK, 
KTAVDMSGGTVTV , 
VTVLEKVPVSKGQ , 
GQLKQYFYETKCN , 
NPMGYTKEGCRGI, 
HWNSQCRTTQSYV, 
LTMDSKKRIGWRF, 
WRFIRIDTSCVCT, 



S EWVT AADKKTAV , 
TAVDMSGGTVTVL , 
TVLEKVPVSKGQL , 
KQYFYETKCNPMG, 
MGYTKEGCRGIDK, 
QS YVRALTMDS KK , 
KRIGWRFIRIDTS , 
RFIRIDTSCVCTL, 



IRIDTSCVCTLTI , IDTSCVCTLTIKR 

Substitutions leading to the elimination of potential T-cell epitopes of human brain-derived 
neutrophic factor (BDNF) (WT = wild type). 
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EXAMPLE 5: De-immunized forms ofFc-EPO 

The present invention provides for modified forms of human erythropioetin (EPO) with one or 
more T-cell epitopes removed. EPO is a 165 amino acid residues glycoprotein hormone 
involved in the maturation of erythroid progenitor cells into erythrocytes. Naturally occurring 
EPO is produced by the liver during foetal life and by the kidney of adults and circulates in the 
blood to stimulate production of red blood cells in bone marrow. Anaemia is almost 
invariably a consequence of renal failure due to decreased production of EPO from the kidney. 
Recombinant EPO is used as an effective treatment of anaemia resulting from chronic renal 
failure. 

Recombinant EPO (expressed in mammalian cells) having the amino acid sequence 1-165 of 
human erythropoietin [Jacobs, K. et al (1985) Nature, 313: 806-810; Lin, F.-K. et al (1985) 
Proc. Natl. Acad. Sci. U.S.A. 82:7580-7585] contains three N-linked and one O-linked 
oligosaccharide chains each containing terminal sialic acid residues. The latter are significant 
in enabling EPO to evade rapid clearance from the circulation by the hepatic 
asialoglycoprotein binding protein. 

Non-de-immunized Fc-EPO is known e.g. from WO 99/58662, WO 99/02709. 

Specific Fc- EPO forms: Fcyl- EPO, Fcy2- EPO, both forms, preferably with linker peptide 

and optionally modified Fc domain having reduced affinity to Fc-receptors. The EPO may be 

glycosylated, partially glycosylated or has a modified glycosylation pattern. 

Peptide sequences in human erythropoietin (EPO) with potential human MHC class II binding 

activity. 
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PRLICDSRVLERY, RLICDSRVLERYL, ICDSRVLERYLLE , CDSRVLERYLLEA, 
SRVLERYLLEAKE , RVLERYLLEAKEA , LERYLLEAKEAEN, ERYLLEAKEAENI , 
RYLLEAKEAENIT , YLLEAKEAENITT , LEAKEAENITTGC , KEAENITTGCAEH , 
ENITTGCAEHCSL , CSLNENITVPDTK, NENITVPDTKVNF , ENITVPDTKVNFY , 

5 NITVPDTKVNFYA , ITVPDTKVNFYAW, TKVNFYAWKRMEV , VNFYAWKRMEVGQ , 
NFYAWKRMEVGQQ , YAWKRMEVGQQAV , KRMEVGQQAVEVW , RMEVGQQAVEVWQ , 
MEVGQQAVEVWQG , QAVEVWQGLALLS , AVEVWQGLALLSE , VEVWQGLALLSEA , 
EVWQGLALLS EAV , VWQGLALLSEAVL , WQGLALLSEAVLR, QGLALLSEAVLRG , 
L ALL S EAVLRGQA , ALLS E A VL RGQ AL , LS EAVLRGQ ALLV , SEAVLRGQALLVN , 

10 EAVLRGQALLVNS , AVLRGQALLVNSS , QALLVNSSQPWEP , ALLVNSSQPWEPL , 
LLVNSSQPWEPLQ , QPWEPLQLHVDKA, EPLQLHVDKAVSG , LQLHVDKAVSGLR , 
LHVDKAVSGLRSL , KAVSGLRSLTTLL , SGLRSLTTLLRAL , RSLTTLLRALGAQ , 
SLTTLLRALGAQK, TTLLRALGAQKEA , TLLRALGAQKEAI , RALGAQKEAI SPP , 
AQKEAI SPPDAAS , EAISPPDAASAAP , SPPDAASAAPLRT , ASAAPLRTITADT , 

15 APLRTITADTFRK , RTITADTFRKLFR , TITADTFRKLFRV, DTFRKLFRVYSNF , 
RKLFRVYSNFLRG, KLFRVYSNFLRGK , FRVYSNFLRGKLK, RVYSNFLRGKLKL , 
YSNFLRGKLKLYT , SNFLRGKLKLYTG , NFLRGKLKLYTGE , RGKLKLYTGEACR , 
GKLKLYTGEACRT , LKLYTGEACRTGD , KLYTGEACRTGDR 

Substitutions leading to the elimination of potential T-cell epitopes of human erythropoietin 
20 (EPO) (WT = wild type). 
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EXAMPLE 6 : De-immunized forms ofG-CSF 

The present invention provides for modified forms of human granulocyte colony stimulating 
factor (G-CSF) with one or more T-cell epitopes removed. G-CSF is an important 
haemopoietic cytokine currently used in treatment of indications where an increase in blood 
neutrophils will provide benefits. These include cancer therapy, various infectious diseases 
and related conditions such as sepsis. G-CSF is also used alone, or in combination with other 
compounds and cytokines in the ex vivo expansion of haemopoeitic cells for bone marrow 
transplantation. 

Two forms of human G-CSF are commonly recognized for this cytokine. One is a protein of 
177 amino acids, the other a protein of 174 amino acids [Nagata et al. (1986), EMBO J. 5: 
575-581], the 174 amino acid form has been found to have the greatest specific in vivo 
biological activity. Recombinant DNA techniques have enabled the production of commercial 
scale quantities of G-CSF exploiting both eukaryotic and prokaryotic host cell expression 
systems. This disclosure specifically pertains to both recognized forms of the human G-CSF 
protein being the 177 amino acid species and the 174 amino acid species. 
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Other polypeptide analogues and peptide fragments of G-CSF have been previously disclosed, 
including forms modified by site-specific amino acid substitutions and or by modification by 
chemical adducts. Thus US 4,810,643 discloses analogues with the particular Cys residues 
replaced with another amino acid, and G-CSF with an Ala residue in the first (N-terminal) 
position. EP 0 335 423 discloses the modification of at least one amino group in a polypeptide 
having G-CSF activity. EP 0 272 703 discloses G-CSF derivatives having amino acid 
substituted or deleted near the N-terminus. EP 0 459 630 discloses G-CSF derivatives in 
which Cys 17 and Asp 27 are replaced by Ser residues. EP 0 243 153 discloses G-CSF 
modified by inactivating at least one yeast KEX2 protease processing site for increased yield 
in recombinant production and US 4,904,584 discloses lysine altered proteins. WO 90/12874 
discloses further Cys altered variants and Australian patent document AU 10948/92 discloses 
the addition of amino acids to either terminus of a G-CSF molecule for the purpose of aiding 
in the folding of the molecule after prokaryotic expression. A further Australian document; 
AU 76380/91, discloses G-CSF variants at positions 50-56 of the G-CSF 174 amino acid form, 
and positions 53-59 of the 177 amino acid form. Additional changes at particular His residues 
were also disclosed. 

Non-deimmunized Fc-G-CSF is known e.g. from WO 99/58662. Specific Fc- G-CSF forms: 
Fcyl- G-CSF, Fcy2- G-CSF, both forms, preferably with linker peptide and optionally 
modified Fc domain having reduced affinity to Fc-receptors. 

Peptide sequences in human granulocyte colony stimulating factor (G-CSF) with potential 
human MHC class II binding activity. 
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GLLQALEGISPEL 

DTLQLDVADFATT 

TIWQQMEELGMAP 

PALQPTQGAMPAF 

GGVLVASHLQSFL 

QSFLEVSYRVLRH 



SSLPQSFLLKCLE 
KCLEQVRKIQGDG 
EKLVSECATYKLC 
ATYKLCHPEELVL 
HSLGIPWAPLSSC 
GCLSQLHSGLFLY 
LF LYQGLLQALEG 
QALEGISPELGPT 
LQLDVADFATTIW 
QQMEELGMAPALQ 
GAMPAFASAFQRR 
GVLVASHLQSFLE 
SFLEVSYRVLRHL 



QSFLLKCLEQVRK 
EQVRKIQGDGAAL 
KLVS EC ATYKLCH 
YKLCHPEELVLLG 
IPWAPLSSCPSQA 
SQLHSGLFLYQGL 
F LYQGLLQALEG I 
EGISPELGPTLDT 

LDVADFATTIWQQ 
EELGMAPALQPTQ 
PAFASAFQRRAGG 
VLVAS HLQSFLEV 
LEVSYRVLRHLAQ 



SFLLKCLEQVRKI, 
RKIQGDGAALQEK, 
AALQEKLCATYKL , 
EELVLLGHSLGIP , 
APLSSCPSQALQL, 
SGLFLYQGLLQAL , 
QGLLQALEGISPE, 
PTLDTLQLDVADF , 
TTI WQQMEELGMA , 
LGMAPALQPTQGA , 
S AFQRRAGGVLVA , 
SHLQSFLEVSYRV, 
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Substitutions leading to the elimination of potential T-cell epitopes of human granulocyte 
colony stimulating factor (G-CSF) (WT = wild type). 
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EXAMPLE 7: De-immunized forms ofKGF 

The present invention provides for modified forms of human keratinocyte growth factor 
(KGF) with one or more T-cell epitopes removed. KGF is a member of the fibroblast growth 
factor (FGF) / heparin-binding growth factor family of proteins. It is a secreted glycoprotein 

5 expressed predominantly in the lung, promoting wound healing by stimulating the growth of 
keratinocytes and other epithelial cells [Finch et al (1989), Science 24: 752-755; Rubin et al 
(1989), Proc. Natl. Acad. Sci. U.S.A. 86: 802-806]. The mature (processed) form of the 
glycoprotein comprises 163 amino acid residues and may be isolated from conditioned media 
following culture of particular cell lines [Rubin et al, (1989) ibid.], or produced using 

10 recombinant techniques [Ron et al (1993) J. Biol. Chem. 268: 2984-2988]. The protein is of 
therapeutic value for the stimulation of epithelial cell growth in a number of significant 
disease and injury repair settings. This disclosure specifically pertains the human KGF protein 
being the mature (processed) form of 163 amino acid residues. Others have also provided 
KGF molecules [e.g. US, 6,008,328; WO90/08771;] including modified KGF [Ron et al 

15 (1993) ibid; WO9501434] but 

Specific Fc- KGF forms: Fcyl- KGF, Fcy2- KGF, both forms, preferably with linker peptide 
and optionally modified Fc domain having reduced affinity to Fc-receptors. 
Peptide sequences in human keratinocyte growth factor (KGF) with potential human MHC 
class II binding activity. 



20 



25 



NDMTPEQMATNVN , 
RSYDYMEGGDIRV, 
I RVRRLFCRTQWY , 
QWYLRIDKRGKVK, 
QEMKNNYNIMEIR, 
ME I RTVAVG I VAI , 



DMT P EQMATNWC , 
YDYMEGGD IRVRR , 
RRLFCRTQWYLRI , 
WYLRIDKRGKVKG , 



EQMATNVNCSSPE, 
DYMEGGDIRVRRL, 



TNVNC S S PERHTR , 
GDIRVRRLFCRTQ, 



RLFCRTQWYLRID , TQWYLRIDKRGKV , 



LRIDKRGKVKGTQ , 



NNYNIME I RTVAV , YN I ME I RTVAVG I , 



GKVKGTQEMKNNY , 
NIMEIRTVAVGIV, 



RTVAVG I VA I KG V , VAVGIVAIKGVES , VG I VAI KG VE S EF , 



V A I KGVE S E FY LA , KG VE S EFYLAMNK , SEFYLAMNKEGKL , EFYLAMNKEGKLY , 
FYLAMNKEGKLYA , LAMNKEGKLYAKK , GKLYAKKECNEDC , KLYAKKECNEDCN , 
CNFKELILENHYN, KELILENHYNTYA , ELILENHYNTYAS , LILENHYNTYASA, 



NHYNTYASAKWTH , NTYASAKWTHNGG , AKWTHNGGEMFVA , 
30 EMFVALNQKGIPV, FVALNQKGI PVRG , VALNQKGIPVRGK , 



GEMFVALNQKGIP, 
KGIPVRGKKTKKE , 



I P VRGKKTKKEQK , KTKKEQKTAHFLP 
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Substitutions leading to the elimination of potential T-cell epitopes of human keratinocyte 
growth factor (KGF) (WT = wild type). 



Residue 


WT 












Substitution 
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Mm 


residue 
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EXAMPLE 8: De-immunized sTNF-R(I) and sTNF Inhibitor within corresponding Fc fusions 
Fc-sTNF-R(I) and Fc-sTNF Inhibitor are fusion proteins in which the serum half-life is 
extended compared to sTNF-R(I) and sTNF Inhibitor itself. However, certain forms of Fc- 
TNF-RI, such as when the Fc is derived from human IgGl or human IgG3, have the potential 
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to show enhanced immunogenicity under certain circumstances, such as administration by 
subcutaneous injection. The present invention provides for modified forms of a soluble tumor 
necrosis factor receptor type I (sTNF-RI) with one or more T-cell epitopes removed. 
The sTNF-RI (soluble tumor necrosis factor receptor type I) is a derivative of the human 

5 tumor necrosis factor receptor described previously [Gray, P.W. et al (1990) Proc. Nat. Acad. 
Sci. U.S.A. 87: 7380-7384; Loetschere, H. et al, (1990) Cell 61: 351-359; Schall, T.J. et al 
(1990) Cell 61; 361-370], comprising the extracellular domain of the intact receptor and 
exhibiting an approximate molecular weight of 30KDa. Additional soluble TNF inhibitors and 
in particular a 40KDa form are also known [US 6,143,866]. The soluble forms are able to 

10 bind tumor necrosis factor alpha with high affinity and inhibit the cytotoxic activity of the 
cytokine in vitro. Recombinant preparations of sTNF-RI are of significant therapeutic value 
for the treatment of diseases where an excess level of tumor necrosis factor is causing a 
pathogenic effect. Indications such as cachexia, sepsis and autoimmune disorders including, 
and in particular, rheumatoid arthritis and others may be targeted by such therapeutic 

15 preparations of sTNF-RI. Others including Brewer et al., US, 6,143,866, have provided 
modified sTNF-RI molecules 

Peptide sequences in a human 30KDa sTNF-RI with potential human MHC class II binding 

■ 

activity: 

DSVC PQGKYIHPQ , KYIHPQNNS ICCT , NS ICCTKCHKGTY , TYLYNDC PGPGQD , 

20 YLYNDCPGPGQDT, NHLRHCLSCSKCR, HCLSCSKCRKEMG, KEMGQVEISSCTV, 

GQVE I S SCTVDRD , VEISSCTVDRDTV, CTVDRDTVCGCRK, DTVCGCRKNQYRH, 

NQYRHYWSENLFQ, RHYWSENLFQCFN, HYWSENLFQCFNC , ENLFQCFNCSLCL , 

NLFQCFNCSLCLN, QCFNCSLCLNGTV, CSLCLNGTVHLSC , LCLNGTVHLSCQE, 

GTVHLSCQEKQNT, VHLSCQEKQNTVC , EKQNTVCTCHAGF , NTVCTCHAGFFLR, 

25 GFFLRENECVSCS, FFLRENECVSCSN, ECVSCSNCKKSLE, KSLECTKLCLPQI , 

TKLC LPQI ENVKG , LCLPQI ENVKGTE , PQ I ENVKGTED S G , S GTTVLL PLVIFF 

Peptide sequences in a human 40KDa sTNF inhibitor with potential human MHC class II 
binding activity. 

r 

TPYAPEPGSTCRL, CRLREYYDQTAQM , 
30 AQMCCSKCSPGQH, KC S PGQHAKVFCT , 

STYTQLWNWVPEC , TQLWNWVPECLSC , 

ECLSCGSRCSSDQ , S RC S S DQEVTQAC , 

NRICTCRPGWYCA, PGWYC ALS KQEGC , 

APLRKCRPGFGVA, PGFGVARPGTETS , 
35 GTFSNTTSSTDIC, TDICRPHQICNW, 

CNWAI PGNASRD . NWAI PGNASRDA, 



REYYDQTAQMCCS , 
AKVFCTKTSDTVC , 
QLWNWVPECLSCG, 
QEVTQACTREQNR , 
GWYC ALS KQEGC R, 
FGVARPGTETSDV, 
HQ I CNWA I PGNA , 
VAIPGNASRDAVC, 



EYYDQTAQMCCSK, 
KVFCTKTSDTVCD, 
NWVPECLSCGSRC, 
QNRICTCRPGWYC , 
CALSKQEGCRLCA, 
SDWCKPCAPGTF, 
ICNWAIPGNASR, 
DAVCTSTTTPTRS , 



WO02066514 f file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WQ02066514.cp c1 Pag e 60 of 92 



WO 02/066514 PCT/EP02/01690 

- 59 - 

TRSMAPGAVHLPQ , RSMAPGAVHLPQP , VHLPQPVSTRSQH , QPVSTRSQHTQPT , 
PEPSTAPSTSFLL, SFLLPMGPSPPAE, FLL PMG P S P PAEG 



10 



15 



20 



25 



30 



EXAMPLE 9 (soluble TNF-R2): 

Fc-sTNF-R2 is a fusion proteins in which the serum half-life is extended compared to sTNF- 
R2 itself. However, certain forms of Fc-TNF-R2, such as when the Fc is derived from human 
IgGl or human IgG3, have the potential to show enhanced immunogenicity under certain 
circumstances, such as administration by subcutaneous injection. 
Soluble tumor necrosis factor receptor 2 (sTNF-R2) is a derivative of the human tumor 
necrosis factor receptor 2 described previously [Smith, C.A. et al (1990.) Science 248 : 1019- 
1023; Kohno, T. et al (1990) Proc. Nat. Acad. ScL U.S.A. 87: 8331-8335; Beltinger, CP. et al 
(1996) Genomics 35:94-100] comprising the extracellular domain of the intact receptor. The 
soluble forms are able to bind tumour necrosis factor with high affinity and inhibit the 
cytotoxic activity of the cytokine in vitro. Recombinant preparations of sTNF-R2 are of 
significant therapeutic value for the treatment of diseases where an excess level of tumour 
necrosis factor is causing a pathogenic effect. A particular recombinant preparation termed 
ethanercept has gained clinical approval for the treatment of rheumatoid arthritis and this and 
other similar agents may be of value in the treatment of other indications such as cachexia, 
sepsis and autoimmune disorders. Ethanercept is a dimeric fusion protein comprising the 
extracellular domain of the human TNFR2 molecule in combination with the Fc domain of the 
human IgGl molecule. The dimeric molecule comprises 934 amino acids [US,5 ,395,760; 
US,5,605,690; US ,5 ,945, 397]. 

Peptide sequences in the TNF binding domain of the human TNFR2 protein with potential 
human MHC class II binding activity are: 



TPYAPEPGSTCRL, 
AQMCC SKC S PGQH , 
STYTQLWNWVPEC, 
ECLSCGSRCSSDQ, 
NRICTCRPGWYCA, 
APLRKCRPGFGVA, 
GTFSNTTSSTDIC, 
CNWAI PGNASRD , 
TRSMAPGAVHLPQ, 
PEPSTAPSTSFLL , 



CRLREYYDQTAQM, 
KCSPGQHAKVFCT, 
TQLWNWVPECLSC , 
S RC S S DQ EVTQ AC , 
PGWYCALSKQEGC, 
PGFGVARPGTETS , 
TDICRPHQICNW, 
NWAI PGNASRD A , 
RSMAPGAVHLPQP, 
SFLLPMGPSPPAE, 



REYYDQTAQMCCS 
AKVFC TKT S DTVC 
QLWNWVPECLSCG 
QEVTQACTREQNR 
GWYCALSKQEGCR 
FGVARPGTETSDV 
HQICNWAI PGNA 
VAI PGNASRDAVC 
VHLPQPVSTRSQH 
FLLPMGPSPPAEG 



EYYDQTAQMCCSK , 
KVFCTKTSDTVCD, 
NWVPECLSCGSRC, 
QNRICTCRPGWYC, 
CALSKQEGCRLCA, 
SDWCKPCAPGTF, 
ICNWAIPGNASR, 
DAVCTSTTTPTRS , 
QPVSTRSQHTQPT, 



35 
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EXAMPLE 10: non-natural forms of Beta-Glucocerebrosidase (B-GCR) 
Fc - B-GCR is a fusion proteins in which the serum half-life is extended compared to the B- 
GCR itself. However, certain forms of Fc - B-GCR, such as when the Fc is derived from 
human IgGl or human IgG3, have the potential to show enhanced immunogenicity under 
5 certain circumstances, such as administration by subcutaneous injection. The present invention 
provides for modified forms of human GCR, preferably Fc-B-GCR, with one or more T-cell 
epitopes removed. 

Beta-Glucocerebrosidase (b-D-glucosyl-N-acylsphingosine glucohydrolase, E.C 3.2.1.45) is a 
monomeric glycoprotein of 497 amino acid residues. The enzyme catalyses the hydrolysis of 

10 the glycolipid glucocerebroside to glucose and ceramide. Deficiency in GCR activity results 
in a lysosomal storage disease referred to as Gaucher disease. The disease is characterised by 
the accumulation of glucocerebroside engorged tissue macrophages that accumulate in the 
liver, spleen, bone marrow and other organs. The disease has varying degrees of severity from 
type 1 disease with haematologic problems but no neuronal involvement, to type 2 disease 

15 manifesting early after birth with extensive neuronal involvement and is universally 
progressive and fatal within 2 years of age. Type 3 disease is also recognised in some 
classifications and also shows neurologic involvement. Previously the only useful therapy for 
Gaucher disease has been administration of GCR derived from human placenta (known as 
alglucerase) but more recently pharmaceutical preparations of recombinant GCR ("Ceredase" 

20 and "Cerezyme") have shown efficacy in the treatment of type I disease [Niederau, C. et al 
(1998) Eur. /. Med. Res. 3: 25-30]. 

According to the invention, the particular commercial forms of glucocerebrosidase are 
examined at predicted to be particularly immunogenic because these forms are engineered to 
have a high mannose oligosaccharide. Upon administration of such a protein, such as 

25 Ceredase or Cerezyme, the non-natural protein is preferentially bound by mannose receptors 
on antigen-presenting cells such as macrophages or dendritic cells. The non-natural protein is 
then taken up, a portion is degraded into peptides, and the peptides presented through MHC 
Class II to the T-cell receptor. By mutating the glucocerebrosidase sequence such that derived 
peptides cannot bind to MHC Class II, immunogenicity is reduced. 

30 Others have provided GCR molecules including modified GCR [US,5,236,838 ] but this 

teaching does not recognize the importance of T-cell epitopes to the immunogenic properties 
of the protein. 
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Peptide sequences 

PCIPKSFGYSSW, 
VCVCNATYCDSFD, 
GTFSRYESTRSGR, 

5 MELSMGPIQANHT, 
GLLLTLQPEQKFQ, 
QKFQKVKGFGGAM , 
AMTD AAALN I L AL , 
NILALS PPAQNLL , 

10 QNLLLKS YFSEEG , 
FSEEGIGYNIIRV, 
NIIRVPMASCDFS, 
SCDFSIRTYTYAD, 
YTYADTPDDFQLH , 

1 5 HNFSL PEEDTKLK , 
KLKIPLIHRALQL, 
RALQLAQRPVSLL , 

VSLLASPWTSPTW, 

- - .i 

TWLKTNGAVNGKG , 

20 QTWARYFVKFLDA, 
FVKFLDAYAEHKL , 
HKLQFWAVTAENE , 
VTAENEPSAGLLS , 
YPFQCLGFTPEHQ, 

25 DF I ARDLG PTLAN , 
VRLLMLDDQRLLL, 
DQRLLLPHWAKW , 
WAKWLTD P E AAK , 
AKYVHGI AVHWYL , 

30 VHWYLDFLAPAKA , 
AKATLGETHRLF P , 
HRLFPNTMLFASE, 
MLFASEACVGSKF. 
QSVRLGSWDRGMQ , 

35 RGMQYSHS I ITNL , 
S I ITNLLYHWGW , 
HWGWTDWNLALN , 
LALNPEGGPNWVR, 
S PI IVDITKDTFY , 

40 TFYKQPMFYHLGH, 



- 61 - 

in human B-GCR with potential human MHC class II binding activity are: 



KSFGYSSWCVCN, FGYSSWCVCNAT 
ATYCDSFDPPTFP , DSFDPPTFPALGT 
SRYESTRSGRRME, GRRMELSMGPIQA 
LSMGPIQAJNHTGT, MGPIQANHTGTGL 
LLLTLQPEQKFQK, LTLQPEQKFQKVK 
QKVKGFGGAMTDA , KGFGGAMTDAAAL 
MTDAAALNILALS , AALNILALSPPAQ 
LALSPPAQNLLLK, ALSPPAQNLLLKS 
NLLLKSYFSEEGI , LLLKSYFSEEGIG 
EGIGYNIIRVPMA, GIGYNIIRVPMAS 
IIRVPMASCDFSI , IRVPMASCDFSIR 
C D F S I RTYTYADT , F S I RT YT Y ADT PD 
ADTPDDFQLHNFS , PDDFQLHNFSLPE 
FSLPEEDTKLKIP, SLPEEDTKLKI PL 
LKIPLIHRALQLA, I PLIHRALQLAQR 
ALQLAQRPVSLLA, LQLAQRPVSLLAS 
SLLAS PWT S PTWL , S PWTS PTWLKTNG 
GAVNGKGSLKGQP, GSLKGQPGDIYHQ 
WARYFVKFLDAYA , ARYFVKFLDAYAE 
VKFLDAYAEHKLQ , KFLDAYAEHKLQF 
LQFWAVTAENE PS , QFWAVTAENE PS A 
PSAGLLSGYPFQC , AGLLSGYPFQCLG 
QCLGFTPEHQRDF , LGFTPEHQRDFIA 
RDLGPTLANSTHH , LG PTLANSTH HNV 
RLLMLDDQRLLLP , LLMLDDQRLLLPH 
QRLLL PHWAKWL , RLLLPHWAKWLT 
AKWLTDPEAAKY, KWLTD PEAAKYV 
KYVHGI AVHWYLD , YVHGIAVHWYLDF 
HWYLDFLAPAKAT , WYLDFLAPAKATL 
ATLGETHRLFPNT, GETHRLF PNTMLF 
RLF PNTMLF AS EA , F PNTMLF AS EACV 
ACVGS KFWEQS VR , GSKFWEQSVRLGS 
VRLGSWDRGMQYS , RLGSWDRGMQYSH 
MQYSHS I ITNLLY , QYSHS I ITNLLYH 
TNL L Y HWGWTDW , NLLYHWGWTDWN 
WGWTDWNLALNP , VGWTDWNLALNPE 
PNWVRNFVDS P 1 1 , NWVRNFVDS P 1 1 V 
PIIVDITKDTFYK, I IVDITKDTFYKQ 
QPMFYHLGHFSKF, PMFYHLGHFSKFI 



SSWCVCNATYCD 

PTFPALGTFSRYE 

RRMELSMGPIQAN 

G P I QANHTGTGLL 

TLQPEQKFQKVKG 

GFGGAMTDAAALN 

ALNILALS PPAQN 

PAQNLLLKSYFSE 

KSYFSEEGIGYNI 

IGYNIIRVPMASC 

VPMASCDFSIRTY 

RTYTYADT PDDFQ 

DDFQLHNFSLPEE 

EEDTKLKI PLIHR 

PLIHRALQLAQRP 

RPVSLLASPWTSP 

TS PTWLKTNGAVN 

GDIYHQTWARYFV 

RYFVKFLDAYAEH 

DAYAEHKLQFWAV 

FWAVTAENEPSAG 

GLLSGYPFQCLGF 

FTPEHQRDFIARD 

PTLANSTHHNVRL 

LMLDDQRLLLPHW 

LLLPHWAKWLTD 

WLTDPEAAKYVH 

HG I AVHWYLD FLA 

LDFLAPAKATLGE 

ETHRLF PNTMLF A 

NTMLFASEACVGS 

SKFWEQSVRLGSW 

GSWDRGMQYSHSI 

YSHSI ITNLLYHV 

LLYHWGWTDWNL 

TDWNLALNPEGGP 

RNFVDSPIIVDIT 

VDITKDTFYKQPM 

MFYHLGHFSKFIP 



S WCVCNATYCDS , 
PALGTFSRYESTR, 
RMELSMGPIQANH , 
TGLLLTLQPEQKF, 
PEQKFQKVKGFGG , 
GAMTDAAALNILA , 
LNILALSPPAQNL, 
AQNLLLKSYFSEE , 
SYFSEEGIGYNII , 
YNIIRVPMASCDF, 
PMASCDFS I RTYT , 
TYTYADTPDDFQL, 
FQLHNFSLPEEDT, 
TKLKI PLI HRALQ , 
HRALQLAQRPVSL , 
PVSLLASPWTSPT, 

• * 

PTWLKTNGAVNGK , 
DIYHQTWARYFVK , 
YFVKFLDAYAEHK , 
YAEHKLQFWAVTA , 
WAVTAENEPSAGL, 
SGYPFQCLGFTPE, 
RDFIARDLGPTLA, 
HNVRLLMLDDQRL , 
DDQRLLLPHWAKV , 
PHWAKWLTDPEA , 
EAAKYVHG I AVHW , 
IAVHWYLDFLAPA , 
DFLAPAKATLGET, 
THRLFPNTMLFAS , 
TMLFASEACVGSK, 
KFWEQSVRLGSWD, 
WDRGMQYSHSIIT, 
HSI ITNLLYHWG , 
YHWGWTDWNLAL , 
WNLALNPEGGPNW, 
NFVDSPIIVDITK, 
DTFYKQPMFYHLG , 
YHLGHFSKF I PEG , 
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GHFSKF I PEGSQR , SKF I PEGSQRVGL , KF I PEGSQRVGLV , I PEGSQRVGLVAS , QRVGL VAS Q KNDL , 

VGLVASQKNDLDA , GLVASQKNDLDAV . SQKNDLDAVALMH , NDLDAVALMHPDG, DAVALMHPDGSAV , 

VALMH P DG S AWV , ALMHPDGSAWW, SAWWLNRSSKD, AVWVLNRSSKDV, VWVLNRSSKDVP , 

WVLNRSSKDVPL, WLNRS S KDVPLT , KDVPLTIKDPAVG , VPLTIKDPAVGFL , PLTIKDPAVGFLE, 

5 LTIKDPAVGFLET, PAVGFLETI S PG Y , VGFLETISPGYSI , GFLETISPGYSIH , FLETISPGYSIHT, 

ETISPGYSI HTYL , PG YS I HTYLWHRQ , PGYS I HTYLWRRQ 

EXAMPLE 1 1 : De-immunized forms ofFc-112 

Non-deimmunized Fc-IL2 was described e.g. in WO 96/08570. Specific de-immunized Fc- 
10 IL2 forms: Fcyl- IL2 , Fcy2- IL2 , both forms, preferably with linker peptide and optionally 
modified Fc domain having reduced affinity to Fc-receptors. 

EXAMPTP. 12: De-immunized forms Fc-IL12 

Non-deimmunized Fc- EL12 was described e.g. in WO 99/29732. Specific de-immunized Fc- 
15 IL12 forms: Fcyl- IL12 , Fcy2- IL12, both forms, preferably with linker peptide and 
optionally modified Fc domain having reduced affinity to Fc-receptors. 

EXAMPLE 13: De-immunized forms of Fc-TNFa 

Non-deimmunized Fc- TNFa was described e.g. in WO 99/43713. Specific de-immunized Fc- 
20 TNFa forms: Fcyl- TNFa , Fcy2- TNFa , both forms, preferably with linker peptide and 
optionally modified Fc domain having reduced affinity to Fc-receptors. 

E AMPLE 14: De-immunized forms of Fc-GM-CSF 

Non-deimmunized Fc- GM-CSF was described e.g. in WO 99/43713 and WO 01/07081. 
25 Specific de-immunized Fc- GM-CSF forms: Fcyl- GM-CSF , Fcy2- GM-CSF , both forms, 
preferably with linker peptide and optionally modified Fc domain having reduced affinity to 
Fc-receptors. 

EXAMPLE 15: De-immunized forms of Fc-subtilisin 

30 Specific de-immunized Fc- subtilisin forms: Fcyl- subtilisin , Fcy2- subtilisin, both forms, 
preferably with linker peptide and optionally modified Fc domain having reduced affinity to 
Fc-receptors. 
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EXAMPLE 16: 



De-immunized forms of Fc-insulin 



Specific de-immunized Fc- insulin forms: Fcyl- insulin , Fcy2- insulin, both forms, preferably 
with linker peptide and optionally modified Fc domain having reduced affinity to Fc -receptors. 



Non-deimmunized Fc- PSMA was described e.g. in WO 96/08570 and WO 01/0708. Specific 
de-immunized Fc- PSMA forms: Fcyl- PSMA , Fcy2- PSMA, both forms, preferably with 
linker peptide and optionally modified Fc domain having reduced affinity to Fc-receptors. 

10 EXAMPLE 18: 

De-immunized fusion proteins comprising anti-EGFR antibodies fused to a cytokine 
Humanized and murine monoclonal antibody 425 (hMAb 425, US 5,558,864; EP 0531 472), 
murine and chimeric monoclonal antibody 225 (cMAb 225, US 4,943,533 and EP 0359 282) 
and murine and humanized MAb 4D5 (hMab 4D5 = Herceptin®) have been de-immunized 

15 according to the invention and fused to a de-immunized IL2 or a non-modified EL-2. 
Fusions of antibodies to cytokines represent a situation where the need to reduce 
immunogenicity is particularly great. Normally, therapeutic antibodies can induce anti- 
idiotype antibodies that neutralize the effectiveness of a therapeutic antibody. This is 
particularly true when a therapeutic antibody is administered at low or medium levels, as 

20 opposed to very high levels where tolerance can be induced. For example, the therapeutic 
antibodies Herceptin and Rituxan are generally given in high doses of a few hundred 
milligrams. In contrast, antibody-cytokine fusions are generally given in a lower dose on the 
order of a few milligrams. Thus, the dose of an antibody-cytokine fusion is in the range that 
tends to promote formation of anti-idiotype antibodies. The presence of the linked cytokine 

25 tends to exaggerate the immunogenicity of the already immunogenic antibodies. 

Antibody 425 is a non-human antibody which is directed to antigen EGF-R and reacts with 
colon cancer cells. This antibody has been fused to IL-2, as described in Example 13. The 
presence of IL-2 or another cytokine enhances the immunogenicity of the antibody, in 
particular the V regions. 

30 In the following paragraphs the invention is described in more detail for the monoclonal anti- 
EGFR antibody 425- EL2 construct which was shown to have a high therapeutic value. 
However, the invention is not limited to this antibody and said construct and its several 
existing forms, but can be extended to other anti-EGFR antibodies and their fusion constructs, 



5 EXAMPLE 17: 



De-immunized forms of Fc-PSMA 
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preferably cytokine fusion immunoglobulins, above all chimeric antibody 225 (c225 -11-2), 
which has very similar properties. In prinicipal, non-human, chimeric or humanized versions 
of the anti-EGFR antibodies can be used to synthesize said IL-2 fusion molecules 
Unless stated otherwise all amino acids in the variable heavy and light chains are numbered as 
5 in Kabat et al, 1991 (Sequences of Proteins of Immunological Interest, US Department of 

Health and Human Services). Potential T-cell epitopes are numbered with the linear number of 
the first amino acid of an epitope, counting from the first amino acid of the heavy and light 
chains. 

L Comparison with Mouse Subgroup Frameworks 

10 The amino acid sequences of murine 425 VH (heavy chain) and VK (light chain) were 

compared to consensus sequences for the Kabat murine heavy and light chain subgroups. 425 
VH can be assigned to mouse heavy chains subgroup IIB. The comparison with the consensus 
sequence for this subgroup shows that the serine at position 94 in 425 VH is unusual. The 
most common residue at this position is arginine. 425 VK can be assigned to mouse kappa 

15 chains subgroup VI. The comparison with the consensus sequence for this subgroup shows 
that the residues at positions 45-47, 60 and 100 in 425 VK are unusual for this subgroup. 
Amino acid residue numbering is as per Kabat. 
2. Comparison with Human Frameworks 

The amino acid sequences of murine 425 VH (variable heavy chain) and VK (variable kappa 
20 light chain) were compared to the sequences of the directory of human germline VH 

(Tomlinson, I.M et al., (1992) J. Mol.Biol. 227: 776-798) and VK (Cox, J.P.L. et al., (1994) 
Eur. J. Immunol. 24-827-836) sequences and also to human germline J region sequences 
(Routledge, E.G. et al., in, Protein Engineering of Antibody Molecules for Prophylactic and 
Therapeutic Applications in Man, Clark, M. ed. Academic Titles, Nottingham, UK, ppl3-44, 
25 1991). The murine 425 sequence of the heavy and light chain can be taken, for example, from 
EP 053 1 472. 

The reference human framework selected for 425 VH was VH1GRR with human JH6. The 
sequence of VH1GRR in the directory ends at residue 88. Therefore there is no corresponding 
residue for the unusual serine at position 94 of the murine sequence. This germline sequence 
30 has been found in a rearranged mature antibody gene with 4 amino acid changes. The 

reference human framework selected for 425 VK was L6/vg with human JK2. This germline 
sequence has been found in a rearranged mature antibody heavy chain with no amino acid 
changes. 
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3. Design of "veneered" sequences 

Following identification of the reference human framework sequences, certain non-identical 
amino acid residues within the 425 VH and VK frameworks were changed to the 
corresponding amino acid in the human reference framework sequence. Residues which are 

5 considered to be critical for antibody structure and binding were excluded from this process 
and not altered. The murine residues that were retained at this stage are largely non-surface, 
buried residues, apart from residues at the N-terminus for instance, which are close to the 
CDRs in the final antibody (1-8, preferably 1 - 5 amino acid residues). This process produces 
a sequence that is broadly similar to a "veneered" antibody as the surface residues are mainly 

10 human and the buried residues are as in the original murine sequence. 

4. Peptide Threading Analysis 

The murine and veneered 425 VH and VK sequences were analyzed using the method 
according of the invention. The amino acid sequences are divided into all possible 13mers. 
The 13mer peptides are sequentially presented to the models of the binding groove of the 

15 HLA-DR allotypes and a binding score assigned to each peptide for each allele. A 

conformational score is calculated for each pocket-bound side chain of the peptide. This score 
is based on steric overlap, potential hydrogen bonds between peptide and residues in the 
binding groove, electrostatic interactions and favorable contacts between peptide and pocket 
residues. The conformation of each side chain is then altered and the score recalculated. 

20 Having determined the highest conformational score, the binding score is then calculated 

based on the groove-bound hydrophobic residues, the non-groove hydrophilic residues and the 
number of residues that fit into the binding groove. Peptides which are known binders to 
human MHC Class II achieve a high binding score with almost no false negatives. Thus 
peptides that achieve a significant binding score in the current analysis are considered to be 

25 potential T-cell epitopes. The results of the peptide threading analysis are shown in Table 1 for 
425 VH and 425 VK. Potential T Cell epitopes are referred to by the linear number of the first 
residue of the 13mer. 

Table 1: Potential T-cell epitopes in murine and veneered 425 sequences 



Sequence 


Number of potential 
T-cell epitopes 


Number of first residue of 13mer with number of 
bonding alleles in brackets 


Murine 425 VH 


8 


31(7), 35(17), 43(7), 46(8), 58(10), 62(12), 81(11), 
84(16) 


Veneered 425 VH 


7 


31(7), 43(7), 46(8), 58(10), 62(1 1), 81(11), 84(16) 
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Sequence 


Number of potential 
T-cell epitopes 


Number of first residue of 1 3mer with number of 
bonding alleles in brackets 


Murine 425 VK 


9 


1(8), 2(5), 17(5), 27(5), 43(16), 72(18), 75(10), 92(10), 
93(17) 


Veneered 425 VK 


4 


27(5), 43(16), 92(8), 93(17) 



5. Removal of Potential T Cell Epitopes 

The numbering of amino acid residues for substitution is as per Kabat. Potential T Cell 
epitopes are referred to by the linear number of the first residue of the 13mer. 
5 The amino acid substitutions required to remove the potential T-cell epitopes from the 
veneered 425 heavy chain variable region were as follows: 

• Substitution of proline for alanine at residue 41 (Kabat number 41) removes the potential 
epitope at residue number 31. 

• Substitution of proline for leucine at residue 45 (Kabat number 45) removes the 

10 potential epitope at residue number 43. A proline at position 45 is found in a human 

germline VH sequence, DP52. 

• Substitution of alanine for isoleucine at residue 48 (Kabat number 48) removes the 
potential epitope at residue number 46. 

• Substitution of valine for alanine at residue 68 (Kabat number 67) removes the potential 
15 epitope at residue number 58. 

• Substitution of isoleucine for leucine at residue 70 (Kabat number 69) removes the 
potential epitope at residue number 62. 

• Substitution of threonine for serine at residue 91 (Kabat number 87) removes the 
potential epitopes at residue numbers 81 and 84. 

20 The amino acid substitutions required to remove the potential T-cell epitopes from the 
veneered 425 light chain variable region were as follows: 

• Substitution of histidine for tyrosine at residue 35 (Kabat number 36) removes the 
potential epitope at residue number 27 

• Substitution of alanine for threonine at residue 50 (Kabat number 51) removes the 
25 potential epitope at residue number 43. This residue is within CDR2. Alanine is 

commonly found at this position in both human and murine antibodies. An alternative 
substitution to eliminate this epitope is alanine for leucine at position 45 (Kabat number 
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46). There is no conservative substitution that will eliminate the potential epitope. Alanine 
is found at this position in some antibodies. 

• Substitution of proline for isoleucine at residue 94 (Kabat number 95) removes the 
potential epitope at residue number 92. Kabat residue 95 is within CDRL3. Proline is 

5 common at this position in mouse antibody sequences and there is no change outwith the 

CDR that eliminates the potential epitope. 

• Substitution of valine for leucine at residue 103 (Kabat number 104) removes the potential 
epitope at residue number 93. 

(5. Design of de- immunized Sequences 

10 De-immunized heavy and light chain variable region sequences were designed with reference 
to the changes required to remove potential T-cell epitopes and consideration of framework 
residues that might be critical for antibody structure and binding. In addition to the De- 
immunized sequences based on the veneered sequence, an additional sequence was designed 
for each of VH and VK based on the murine sequence, termed the Mouse Peptide Threaded 

15 (Mo PT) version. For this version, changes were made directly to the murine sequence in order 
to eliminate T-cell epitopes, but only changes out with the CDRs that are not considered to be 
detrimental to binding are made. No attempt to remove surface (B-cell) epitopes has been 
made in this version of the de-immunized sequence. 

The primary de-immunized VH includes substitutions 1 to 6 in Section 5 above and includes 
20 no potential T-cell epitopes. A further 4 de-immunized VH sequences were designed in order 
to test the effect of the various substitutions required on antibody binding. The cumulative 
alterations made to the primary de-immunized sequence (425 VHlGRR-VH-vl) and the 
potential T-cell epitopes remaining are detailed in Table 2. The mouse threaded version is 
included for comparison. 
25 Table 2: Amino acid changes and potential epitopes in de-immunized 425 VH 



Variant 


Cumulative Residue Changes 


Potential T Cell Epitopes 


425 VHlGRR-VH-vl 


None 


None 


425 VHlGRR-VH-v2 


48A -> I 


46(8) 


425 VH 1 GRR- VH-v3 


45P -> L 


43(7), 46(8) 


425 VHlGRR-VH-v4 


67V -» A. 691 -» L 


43(7). 46(8), 58(10), 62(11) 


425 VHlGRR-VH-v5 


41P-> A 


31(7), 43(7), 46(8), 58(10), 62(1 1) 


425 VH-MoPT 


NA 


43(7), 46(8) 
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The primary de-immunized VK includes substitutions 1 to 4 in Section 5 above and includes 
no potential T-cell epitopes. A further 4 de-immunized VK sequences were designed in order 
to test the effect of the various substitutions required on antibody binding. Version 2 is an 
alternative to Version 1 in which an alternative substitution has been used to remove the same 
5 potential T-cell epitope. The cumulative alterations made to the primary de-immunized 

sequence (425 L6-vg-VK-vl) and the potential T-cell epitopes remaining are detailed in Table 

3. The mouse threaded version is included for comparison. 

Table 3: Amino acid changes and potential epitopes in de-immunized 425 VK 



Variant 


Cumulative Residue Changes 


Potential T-cell Epitopes 


425 L6-vg-VK-vl 


None 


None 


425 L6-vg-VK-vl 


51 A ->T, 46L-> A 


None 


425 L6-vg-VK-vl 


46 A->L 


43(16) 


425 L6-vg-VK-vl 


95 P -> I 


43(16), 92(8) 


425 L6-vg-VK-vl 


36 H -> Y 


27(5), 43(16), 92(8) 


425VK-MoPT 


NA 


27(5), 43(16), 92(8) 



10 Table 4: original and "veneered" sequences of VH and VK of murine MAb 425 
425 VH mouse 

QVQLQQPGAELVKPGASVKLSCKASGYTFTSHWMHWVKQRAGQGLEWIGEFNPSNGRTNYNEKFKSKAT 
LTVDKS S STAYMQL SSLT S EDS AVYYCASRDYD YDGRYFD YWGQGTTLTVS S 
425 VK mouse 

15 QIVLTQSPAIMSASPGEKVTMTCSASSSVTYMYWYQQKPGSSPRLLIYDTSNLASGVPVRFSGSGSGTS 
YSLTIS RME AEDAAT YYCQQWS SH I FTFGS GTKLE I K 
425 VH veneered: 

QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMHWVKQAAGQGLEWIGEFNPSNGRTNYNEKFKSRAT 
LTVDKST STAYMQ L S S LTS ED S AVY YC AS RD YD YDGRYFD YWGQGTTLTVS S 
20 425 VK veneered: 

QIVLTQSPATLSASPGERATMSCSASSSVTYMYWYQQKPGQSPRLLIYDTSNLASGVPARFSGSGSGTS 
YTLTI S S LEAED AATYYCQQWS SH I FTFGQGTKLEI K 

Table 5: De-immunized sequences of variable heavy and light chain of MAb 425 

25 425 de-immunized VH1 

QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMHWVKQAPGQGPEWAGEFNPSNGRTNYNEKFKSRVT 

ITVDKSTSTAYMQLSSLTSEDTAVYYCASRDYDYDGRYFDYWGQGTTLTVSS 

425 de-immunized VK1 
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QIVLTQSPATLSASPGERATMSCSASSSVTYMYWHQQKPGQSPRLLIYDASNLASGVPARFSGSGSGTS 

YTLTISSLEAEDAATYYCQQWSSHPFTFGQGTKVEIK 

425 de-immunized VH2 

QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMHWVKQAPGQGPEWIGEFNPSNGRTNYNEKFKSRVT 
5 ITVDKSTSTAYMQLSSLTSEDTAVYYCASRDYDYDGRYFDYWGQGTTLTVSS 
425 de-immunized VK2 

QIVLTQSPATLSASPGERATMSCSASSSVTYMYWHQQKPGQSPRALIYDTSNLASGVPARFSGSGSGTS 
YTLT I S SL EAEDAATY YCQQWS SHP FTFGQGTKVE I K 
425 de-immunized VH3 

10 QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMHWVKQAPGQGLEWIGEFNPSNGRTNYNEKFKSRVT 
ITVDKSTSTAYMQLSSLTSEDTAVYYCASRDYDYDGRYFDYWGQGTTLTVSS 
425 de-immunized VK3 

QIVLTQSPATLSAS PGERATMSCSASSSVTYMYWHQQKPGQSPRLLIYDTSNLASGVPARFSGSGSGTS 
YTLT I S S L EAEDAATYYCQ QWS SHP FTFGQGTKVE I K 
15 425 de -immunized VH4 

QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMH^ 

LTVDKSTSTAYMQL S S LT S EDTAVYYC ASRDYDYDGRYFDYWGQGTTLTVS S 
425 de-immunized VK4 

QIVLTQSPATLSASPGERATMSCSASSSVTYMYWHQQKPGQSPRLLIYDTSNLASGVPARFSGSGSGTS 
20 YTLT I S S L EAEDAATYYCQ QWS SHI FTFGQGTKVEI K 
425 de-immunized VH5 

QVQLVQSGAELVKPGASVKLSCKASGYTFTSHWMHWVKQAAGQGLEWIGEFNPSNGRTimJEKFKSRAT 
LTVDKSTSTAYMQLS SLT S EDTAVYYCASRDYDYDGRYFDYWGQGTTLTVS S 
425 de-immunized VK5 

25 QIVLTQSPATLSASPGERATMSCSASSSVTYMYWYQQKPGQSPRLLIYDTSNLASGVPARFSGSGSGTS 
YTLT I S S LEAEDAATY YCQQWS SHI FTFGQGTKVE I K 
425 VH mouse, peptide threaded (Mo PT) 

QVQLQQPGAELVKPGASVKLSCKASGYTFTSHW^WVKQAPGQGLEWIGEFNPSNGRTNYNEKFKSRVT 
ITVDKSSSTAYMQLSSLTSEDTAVYYCASRDYDYDGRYFDYWGQGTTLTVSS 
30 425 VK mouse, peptide threaded (Mo PT) 

QIVLTQSPATLSASPGEKATMTCSASSSVTYMYWYQQKPGSSPRLLIYDTSNLASGVPVRFSGSGSGTS 
Y SLT I SRLEAEDAATYYCQQWS SHI FTFGQGTKVE IK 

As already mentioned, the modified anti-EGFR antibody - cytokine constructs according to 
35 the invention, preferably MAb 425 - 112, can be used in pharmaceutical compositions and 
pharmaceutical kits preferably for the treatment of cancer. "Cancer' 1 and "tumor" refer to or 
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describe the physiological condition in mammals that is typically characterized by unregulated 
cell growth. By means of the pharmaceutical compositions according of the present invention 
tumors can be treated such as tumors of the breast, heart, lung, small intestine, colon, spleen, 
kidney, bladder, head and neck, ovary, prostate, brain, pancreas, skin, bone, bone marrow, 
5 blood, thymus, uterus, testicles, cervix, and liver. 

In analogy to antibody 425 similar fusion constructs can be obtained using monoclaonal 
antibody 225 in murine, chimeric or humanized forms. 

EXAMPLE 19: De-immunized forms of 14 J 8 antibody - IL2 and KS- 1/4 - IL2 . 

10 The cytokine interleukin 2 (EL-2) has been fused to specific monoclonal antibodies KS-1/4 and 
chl4.18 directed to the tumor associated antigens epithelial cell adhesion molecule (Ep-CAM, 
KSA, KS1/4 antigen) and the disialoganglioside GD, respectively, to form the fusion proteins 
chl4.18-IL-2 and KS1/4-DL-2, respectively, (US 5,650,150, EP 0 338 767). Theses antibodies 
have been de-immunized according to the invention and fused to a immunogenicly modified 

15 DL2 or a non-modified EL-2. 
Anti-EpCAM antibody KS 1/4 

The monoclonal antibody KS1/4 is a murine antibody that specifically binds to the 40,000 
dalton cell surface antigen EpCAM (epithelial cell adhesion molecule) found in high density 
on adenocarcinoma cells and also found at much lower levels on certain normal epithelial 

20 cells. This antibody has been shown to be effective for the detection of disease. 

A variety of fusions of KS-1/4 to single and combined cytokines such as EL-2 and EL-12, have 
been described (W098/25978, WO01/58957A, and WO 01/10912). These fusion proteins are 
effective in animal models of cancer. However, due to the presence of cytokines, these fusion 
proteins are particularly immunogenic. There is a need for altered KS antibody molecules with 

25 a reduced propensity to elicit an immune response on administration to the human host. 
Modified sequences in Mab KS1/4 providing a modified KS antibody according to the 
invention are shown below. A mutated form of the KS-1/4 in which the T-cell epitopes in the 
V regions were completely removed by mutation, as defined by the criteria given above in the 
section on computer algorithms, was efficiently expressed in mammalian cells and bound to 

30 the EpCAM antigen with only about an 8-fold reduction of affinity. This molecule was 

termed VHvl/VKvl. A second antibody, VHv2/VKvl, had only about a 3-fold reduction in 
affinity and differed from VHvl/VKvl by a single amino acid substitution. These two 
antibodies have been expressed in mammalian cells as KS-EL2 fusion proteins. The 
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KS(VHvl/VKvl)-IL2 and KS(VHv2/VKvl) are the most preferred embodiments of the 
invention with respect to treatment of a broad spectrum of human cancers by immune therapy. 

1 Comparison with Mouse Subgroup Frameworks 

The amino acid sequences of murine KS VH and VK were compared to consensus sequences 
5 for the Kabat murine heavy and light chain subgroups (Kabat et ah, 1991). Murine KS VH 
cannot be assigned to any one Subgroup, but is closest to Subgroup 11(A) and V(A). Unusual 
residues are found at position 2 which is normally valine, 46 which is normally glutamic acid, 
and 68 which is normally threonine. Residue 69 is more commonly leucine or iso-leucine. At 
82b, serine is most often found. Murine KS VK can be assigned to Subgroup VI ( ' Figure 2). 
10 Unusual residues are found at 46 and 47 which are commonly both leucine. Residue 58 is 
unusual with either leucine or valine normally found at this position. 

2 Comparison with Human Frameworks 

The amino acid sequences of murine KS VH and VK were compared to the sequences of the 
directory of human germline VH (Tomlinson et ah, 1992) and VK (COX et al. 1994) 

15 sequences and also to human germline J region sequences (Routledge et al., 1993). The 

reference human framework selected for KS VH was DP 10 with human JH6. This germline 
sequence has been found in a rearranged mature antibody gene with no amino acid changes. 
The reference human framework selected for KS VK was Bl. For framework- 2 the sequence 
of the mature human antibody IMEV was used (in Kabat et al 1991). This sequence is 

20 identical to the murine sequence immediately adjacent to CDR2. The J region sequence was 
human JK4. This germline sequence has not been found as rearranged mature antibody light 
chain. 

3 Design of Veneered Sequences 

Following identification of the reference human framework sequences, certain non-identical 
25 amino acid residues within the 425 VH and VK frameworks were changed to the 

corresponding amino acid in the human reference sequence. Residues which are considered to 

be critical for antibody structure and bindin2 were excluded from this process and not altered. 

The murine residues that were retained at this stage are largely non-surface, buried residues, 

apart from residues at the N-terminus for instance, which are close to the CDRs in the final 
30 antibody. This process produces a sequence that is broadly similar to a "veneered" antibody as 

the surface residues are mainly human and the buried residues are as in the original murine 

sequence. 

4 Peptide Threading Analysis 
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The murine and veneered KS VH and VK sequences were analyzed using the method 
according to the invention. The amino acid sequences are divided into all possible 13imers. 
The 13-mer peptides are sequentially presented to the models of the binding groove of the 
HLA-DR allotypes and a binding score assigned to each peptide for each allele. A 

5 conformational score is calculated for each pocket-bound side chain of the peptide. This score 
is based on steric overlap, potential hydrogen bonds between peptide and residues in the 
binding groove, electrostatic interactions and favorable contacts between peptide and pocket 
residues. The conformation of each side chain is then altered and the score recalculated. 
Having determined the highest conformational score, the binding score is then calculated 

10 based on the (groove-bound hydrophobic residues, the non-groove hydrophilic residues and 
the number of residues that fit into the binding groove. Known binders to MHC class II 
achieve a significant binding score with almost no false negatives. Thus peptides achieving, a 
significant binding score from the current analysis are considered to be potential T-cell 
epitopes. The results of the peptide threading analysis for the murine and veneered sequences 

15 are shown in Table 1. 

Table 1: Potential T-cell epitopes in murine and veneered KS sequences 



Sequence 


Number of potential T- 
cell epitopes 


Location of potential epitopes (no. of potential 
MHC binders) 


Murine KS VH 


6 


35(11), 62(17), 78(12), 81(12), 
89(6), 98(15) 


Murine KS VH 


5 


30(7), 62(15), 78(11), 89(6), 98(15) 


Murine KS VK 


6 


1(14), 2(5), 17(5), 27(5), 51(13), 
72 (18) 


veneered KS VK 


3 


1(17), 27(5), 51(13) 



5 Removal of Potential T Cell Epitopes 

Potential T-cell epitopes are removed by making amino acid substitutions in the particular 
20 peptide that constitutes the epitope. Substitutions were made by inserting amino acids of 
similar physicochemical properties if possible. However in order to remove some potential 
epitopes, amino acids of different size, charge or hydrophobicity may need to be substituted If 
changes have to made within CDRs which might have an effect on binding, there is then a 
need to make a variant with and without the particular amino acid substitution. Numbering of 
25 amino acid residues for substitution is as per Kabat. Potential T Cell epitopes are referred to by 
the linear number of the first residue of the 13mer. 
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The amino acid changes required to remove T-cell epitopes from the veneered KS heavy chain 
variable region were as follows: 

1. Substitution of arginine for lysine at residue 38 (Kabat number 38) removes the potential 
epitope at residue no 30. 

5 2. Substitution of alanine for leucine at residue 72 (Kabat number 71) and isoleucine for 
phenylalanine at residue 70 (Kabat number 69) removes the potential epitope at residue 62. An 
isoleucine at Kabat number 69 and alanine at Kabat number 71 is found in a human germline 
VH sequence, DP 10. 

3. Substitution of leucine for alanine at residue 79 (Kabat number 78) removes the 
10 potential epitope at residue number 78. 

4. Substitution of threonine for methionine at residue 91 (Kabat number 87), removes the 
potential epitope at residue number 89. 

5. Substitution of methionine for at isoleucine residue 100 (Kabat number 96) in CDRH3 
removes the potential epitope at residue 98. There is no change out with CDRH3 which 

15 removes this potential epitope. 

The amino acid substitutions required to remove the potential T-cell epitopes from the 
veneered KS light chain variable region were as follows: 

1 . Substitution of isoleucine for methionine at residue 32 (Kabat number 33) removes the 
potential epitope at residue number 27. This residue is within CDR2. Isoleucine is 
20 commonly found at this position in human antibodies. 

2. The potential epitope at position 1 is removed by substituting valine for leucine at residue 
(Kabat number 3). 

3. Substitution of serine for alanine at residue 59 (Kabat number 60) removes the potential 
epitope at residue number 51. 

25 6 Design of de-immunized Sequences 

De-immunized heavy and light chain sequences were designed with reference to the changes 
required to remove potential T-cell epitopes and consideration of framework residues that 
might be critical for antibody structure and binding. In addition to the de-immunized 
sequences based on the veneered sequence, an additional sequence was designed for each VH, 

30 and VK based on the murine sequence, termed the Mouse Peptide Threaded (MoPT) version. 
For this version, changes, were made directly to the murine sequence in order to eliminate T- 
cell epitopes, but only changes outside the CDRs that are not considered to be detrimental to 
binding are made. No attempt to remove surface (B cell) epitopes has been made in this 
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version of the de-immunized sequence. The primary de-immunized VH includes substitutions 
1 to 5 in Section 5 above and one extra change at residue 43 (Kabat number 43). Lysine found 
in the murine sequence was substituted for the glutamine from the human framework. Lysine 
is positively charged and therefore significantly different to glutamine; this region may be 
5 involved in VH/VL contacts. The primary de-immunized VH includes no potential T-cell 
epitopes. A further 4 de-immunized VHs were designed in order to test the effect of the 
various substitutions required on antibody binding. The cumulative alterations made to the 
primary de-immunized sequence (KSDIVHvl) and the potential T-cell epitopes remaining are 
detailed in Table 2. 

10 Table 2 : Amino acid changes and potential epitopes in de-immunized KS VH 



Variant 


Cumulative residue 
changes 


Potential epitopes (no. of potential MHC binders 
from 18 tested) 


KSDIVHvl 


None 


none 


KSDIVHV2 


96M -> I 


98 {15) 


KSDIVHv3 


71A -> L, 78L -> A 


62(16), 78(11), 98(15) 


KSDIVHV4 


38 R -» K 


30(7) , 62(16) , 78(11) , 98 (15) 


KSDIVHv5 


68T A, 691 -> F 
3* 


30(7), 62(17), 78(11), 98(15) 


KSMoPTVH 


NA 


98(15), 78(12) 



The primary de-immunized VK includes substitutions 1 to 3 in Section 5 above. A further 3 
de-immunized VKs were designed in order to test the effect of the various substitutions 
required on antibody binding. The cumulative alterations made to the primary de-immunized 
15 sequence (KSDIVKvl) and the potential T-cell epitopes remaining are detailed in Table 3. 



Table 3: Amino acid changes and potential epitopes in de-immunized KS VK 



Variant 


Cumulative residue 
changes 


Potential epitopes (no. of potential MHC binders 
from 18 tested) 


KSDIVKvl 


None 


none 


KSDIVKv2 


331 -> M 


27(5) 


KSDIVKv3 


3V -» L 


1(17), 27(5) 


KSDIVKv4 


60 S -» A 


1(17), 27(5), 5(13) 


KSMoPTVK 


NA 


none 



20 
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Sequences of versions of modified epitopes: 
KS VH veneered: 

QIQLVQSGPELKKPGSSVKiSCKASGYTFTim5Mtf^ 
f TlETSTSTAYLQLNNLRs EDmATYf CVRFI SKGDYWGQGTTVTVS S 
5 KS VK veneered: 

QILLTQSPASLAVSPGQRATITCSASSSVSYMLWYQQKPGQPPKPWIFDTSNLASGFPARFSGSGSGTS 

YTLTINSLEAEDAATYYCHQRSGYPYTFGGGTKVEIK 

KS de- immunized VH1 

QIQLVQSGPELKKPGSSVKISCKASGYTFTNYGMNWVl^QAPGKGLK^ 
10 ITAETSTSTL YLQLNNLRS EDTATYFCVRFMSKGDYWGQGTTVTVS S 
KS de- immunized VK1 

QIVLTQSPASLAVSPGQRATITCSASSSVSYILWYQQKPGQPPKPWIFDTSNLASGFPSRFSGSGSGTS 

YTLTINSLEAEDAATYYCHQRSGYPYTFGGGTKVEIK 

KS de- immunized VH2 

15 QIQLVQSGPELKKPGS SVKI SCKASGYTFTITCGMNWVIIQAPGKGLKW^ 

ITAETSTSTL YLQLNNLRSEDTATYFCVRFISKGDYWGQGTTVTVSS 
KS de- immunized VK2 

QIVLTQSPASLAVSPGQRATITCSASSSVSYMLWYQQKPGQPPKPWIFDTSNLASGFPSRFSGSGSGTS 
YTLTINSLEAEDAATYYCHQRSGYPYTFGGGTKVEIK 
20 KS de- immunized VH3 

QIQLVQSGPELKKPGSSVTCISCKASGYTFTNYGMNWV^^ 

ITLETSTSTAYLQLNNLRSEDTATYFCVRFISKGDYWGQGTTVTVSS 
KS de-immunized VK3 

QILLTQSPASLAVSPGQRATITCSASSSVSYMLWYQQKPGQPPKPWIFDTSNLASGFPSRFSGSGSGTS 
25 YTLTINSLEAEDAATYYCHQRSGYPYTFGGGTKVEIK 
KS de-immunized VH4 

QIQLVQSGPELKKPGS SVKI SCKASGYTFTNYGMNWVXQAPGKGLKWMGWINTYTGEPTYADDFKGRFT 
I TLETSTSTAYLQLNNLRS EDTATYFCVRF I SKGDYWGQGTTVTVS S 
KS de-immunized VK4 

30 QILLTQSPASLAVS PGQRATITCSASSSVSYMLWYQQKPGQPPKPWIFDTSNLASGFPARFSGSGSGTS 
YTLTINSLEAEDAATYYCHQRSGYPYTFGGGTKVEIK 
KS de-immunized VH5 

QIQLVQSGPELKKPGSSVKISCKASGYTFTNYGMNWVKQAPGKGLKWMGWINTYTGEPTYADDFKGRFA 
FTLET STSTAYLQLNNLRS EDTATYFCVRFI SKGDYWGQGTTVTVS S 
35 KS de-immunized VK5 
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QILLTQSPASLAVSPGQRATITCSASSSVSYMLWYQQKPGSSPKPWIYDTSNLASGFPARFSGSGSGTS 
YTLT IN S L E AEDAAT YYC HQRSGYP YTFGGGTKVE I K 
KS VH mouse, peptide threaded (Mo PT) 
QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMISWV^ 
5 FSLETSASTAFLQLNNLRSEDTATYFCVRFISKGDYWGQGTSVTVSS 
KS VK mouse, peptide threaded (Mo PT) 

QIVLTQSPATLSASPGERVTITCSASSSVSYMLWYLQKPGSSPKPWIFDTSNLASGFPSRFSGSGSGTT 
YSLIIS SLE AEDAAT YYC HQRSG YPYTFGGGTKLE I K 
KS VH mouse 

10 QIQLVQSGPELKKPGETVKISCKASGYTFTl^GMNWVKQTPGKGLKWMGWINTYTGEPTYADDFKGRFA 
FSLETSASTAFLQINNLRNEDMATYFCVRFISKGDYWGQGTSVTVSS 
KS VK mouse 

Q I LLTQS PAIMSAS PGEKVTMTC S AS S SVS YMLWYQQKPG S S PKPWIFDTSNLA SGF PARF SGSGSGTS 
Y S L 1 1 S S ME AEDAAT YYC HQRSG YPYTFGGGTKLE I K 

15 

MAb 14.18 -IL-2 



In analogy, monoclonal antibody 14.18 was fused to EL- 2 and deimmunized according to the 
invention. 

Potential T-cell epitopes in murine and veneered 14.18 sequences are: 



Sequence 


Number of potential 
T-cell 


Location of potential epitopes 


Murine 14,18 VH 


11 


3(17), 9(15), 30(5), 35(17), 39(15), 
43(9), 58(12), 62(11), 81(11), 84(16), 
101(7) 


Veneered 14. 18 
VH 


5 


43(9), 58(12), 62(11), 81(11), 84(16) 


Murine 14.18 VK 


7 


7(7), 13(11), 27(15), 49(11), 86(17), 
97(11), 100(4) 


Veneered 14. 18 
VK 


5 


27(15), 49(11), 86(17), 97(11), 100(17) 



20 Amino acid changes and potential epitopes in de-immunized 14.18 VH are: 



Variant 


Cumulative residue changes 


Potential epitopes (no. of potential MHC 
binders from 18 tested) 


14 . 18DIVH1 


none 


none 


14 . 18DIVH2 


411 -» P, 45L -> T, SOL -> 
A 


none 
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14 .18DIVH3 


65S -» G 


58 (8) 


14 . 18DIVH4 


71A -> V 


58(8), 62(4) 


14.18DIVH5 


45T -> L , 41P -» 1 


43(9) 58(8) 62(4) 


14 . 18MOPTVH 


NA 


43(9) 58(12) 62(11) 



Amino acid changes and potential epitopes in de-immunized 14.18 VK are: 



Variant 


Cumulative residue changes* 


Potential epitopes' (no. of potential MHC 
binders from 18 tested) 


14 .18DIVKI 


None 


none 


14 .18DIVK2 


46L-> M, 49Y -» H 


none 


14 . 18DIVK3 


96P -» T, 100Q G 


97(5) 


14 .18DIVK4 


96T -» L 


97(11) 


14.18DIVK5 


27e S -> R 


27(15), 97(11) 


14 . 18DIVK6 


46M -> L 


27(15) , 49 (11) , 97(11) 


14 . 18MOPTVK 


NA 


27(15), 49 (11), 97(11), 100(4) 



Sequences of versions of modified epitopes are: 

5 14.18 VH veneered: 

EVQLLQSGPELKKPGASVKISCKASGSSFTGYl^ 

L S VDK S S S Q A YMH L K S LT S E D S A VY YC VS GME YWGQGTT VT VS S 

14.18 VK veneered: 

DVWTQSPGTLPVSLGERATISCRSSQSLVHRNGNTYLHWYLQKPGQSPKLLIHKVSNRFSGVPDRFSG 
10 SGSGTDFTLTISRLEAEDLAVYFCSQSTHVPPLTFGQGTKLEIK 
14.18 de-immunized VH1 

EVQLLQSGPELKKPGASVXISCKASGSSFTGYNMNWWQAIGQRLEWIGLIDPYYGGTSYNQKFKSRVT 
ITADKS S SQAYMHLKSLTS EDTAVT YCVSGMEYWGQGTTVTVS S 
14.18 de-immunized VK1 
15 DWMTQSPGTLPVSLGERATISCRSSQSLVHSNGNTYLHWYLQKPGQSPKLLIYKVSNRFSGVPDRFSG 
S G S GTDFT LT I S RL E A EDMA VYFC S Q STHVP P PT FGQGTKVE I K 
14.18 de- immunized VH2 

EVQLLQSGPELKKPGASVKISCKASGSSFTGYNMNWVRQAPGQRTEWIGAIDPYYGGTSYNQKFKSRVT 

ITADKS SSQAYMHLKS LTSEDTAVY YCVSGMEYWGQGTTVTVS S 

20 14.18 de-immunized VK2 

DWMTQSPGTLPVSLGERATISCRSSQSLVHSNGNTYLHWYLQKPGQSPKMLIHKVSNRFSGVPDRFSG 

SGSGTDFTLTISRLEAEDMAVYFCSQSTHVPPPTFGQGTKVEIK 
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14.18 de- immunized VH3 

EVQLLQSGPELKKPGASVKISCKASGSSFTGYNMNWVRQAPGQR^ 
ITADKS S SQAYT4HLKSLTSEDTAVYYCVS GMEYWGQGTT VTVS S 
14.18 de- immunized VK3 
5 DVVMTQSPGTLPVSI^ERATISCRSSQSLVHSNGNTYLHWLQKPGQSPKMLIHKVSNRFSGVPDRFSG 
SGSGTDFTLTISRLEAEDMAVYFCSQSTHVPPTTFGGGTKVEIK 
14.18 de-immunized VH4 

EVQLLQSGPELKKPGASVKISCKASGSSFTGYNM3^JWVRQAPGQRTEWIGAIDPYYGGTSYNQKFKGRW 
ITVDKSSSQAYMHLKSLTSEDTAVYYCVSGMEYWGQGTTVTVSS 
10 14.18 de-immunized VK4 

DWMTQSPGTLPVSLGERATISCRSSQSLVHSNGNTYLHWYLQKPGQSPKMLIHKVSNRFSGVPDRFSG 

SGSGTDFTLTISRLEAEDMAVYFCSQSTHVPPLTFGGGTKVEIK 
14.18 de-immunized VH5 

EVQLLQSGPELKKPGASVKISCKASGSSFTGYNMNWVRQAIGQRLEWIGAIDPYYGGTSYNQKFKGRVT 
1 5 I TVDK S S SQAYMHLKSLT S EDT AVYYC VS GMEYWGQGTT VTVS S 
14.18 de-immunized VK5 

DVVMTQSPGTLPVSLGERATISCRSSQSLVHRNGNTYLHWYLQKPGQSPKMLIHKVSNRFSGVPDRFSG 
SGSGTDFTLTISRLEAEDMAVYFCSQSTHVPPLTFGGGTKVEIK 
14.18 VH mouse, peptide threaded (Mo PT) 
20 EVQLVQSGPEVEKPSASVKISCKASGSSFTGYNMI^ 

LTVDK S S STAYMHLKS LTS EDTAVYYCVS GMEYWGQGTT VTVS S 
14.18 VK mouse, peptide threaded (Mo PT) 

DWMTQTPGSLPVSAGDQASISCRSSQSLVHRNGNTYLHWYLQKPGQSPKLLIHKVSNRFSGVPDRFSG 

SGSGTDFTLKISRVEAEDSGVYFCSQSTHVPPLTFGAGTKLELK 

25 14 . 18 VH mouse 

EVQLLQSGPELEKPSASVMISCKASGSSFTGYNMNWVRQNIGKSLEWIGAIDPYYGGTSYNQKFKGRAT 

LTVDKS S STAYMHLKS LTS ED S AVYYC V S GME YWGQGT S VT VS S 
14.18 VK mouse 

DVVMTQTPLSLPVSLGDQASISCRSSQSLVHRNGNTYLHWYLQKPGQSPKLLIHKVSNRFSGVPDRFSG 
30 SGSGTDFTLKI SRVEAEDLGVYFCSQSTHVPPLTFGAGTKLELK 

The foregoing description and the examples are intended as illustrative, and are not to be taken 
as limiting. Still other variants within the spirit and scope of this invention are possible and 
will readily present themselves to those skilled in the art. 



35 
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PATENT CLAIMS: 

— 

1. An immunogenicly modified fusion protein derived from a parent fusion protein, 
essentially consisting of a first protein / polypeptide and a second protein / polypeptide, 

5 wherein the first protein is an immunoglobulin molecule or a fragment thereof and the 

second protein / polypetide is non-immunoglobulin target polypeptide (X) each linked to 
the other directly or by a linker molecule; said modified fusion protein having an amino 
acid sequence different from that of said parent fusion protein and exhibiting reduced 
immunogenicity by a reduced number of T-cell epitopes within its amino acid sequence 

10 relative to the parent fusion protein when exposed to the immune system of a given 

species. 

2. A modified fusion protein according to claim 1, wherein said T-cell epitopes are peptide 
sequences able to bind to MCH class II molecule binding groups. 

15 

3. A modified fusion protein of claim 1 or 2, wherein the target polypeptide (X) is linked by 
its N-terminal to the C-terminal of the immunoglobulin moiety. 

4. A modified fusion protein according to any of the claims 1-3, wherein the given species 
20 is a human. 



5. A modified fusion protein according to any of the claims 1-3, wherein the fusion 
components are fused via a linker molecule L. 

25 6. A modified fusion protein according to claim 4, wherein said linker molecule L is non- 
immunogenic or less immunogenic. 

7. A modified fusion protein according to any of the claims 1-6, wherein the fusion region 
represented the C-terminal region of the immunoglobulin portion and the N-terminal 
30 region of the non-immunoglobulin target polypeptide (X) has no a reduced number of T- 

cell epitopes. 
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8, A modified fusion protein according to any of the claims 1-7, wherein the 
immunoglobulin portion or a fragment thereof is less immunogenic. 

9. A modified fusion protein according to any of the claims 1-7, wherein the target 
5 polypeptide (X) portion is less immunogenic. 



10. A modified fusion protein according to any of the claims 1-9, wherein said 
immunoglobulin molecule or fragment thereof is IgGl or IgG2. 



10 1 1. A modified fusion protein according to any of the claims 1 - 10 , wherein said 
immunoglobulin fragment is a Fc portion. 

12. A modified fusion protein according to claim 11, wherein said Fc portion has a reduced 
affinity to Fc receptors. 

15 

13. A modified fusion protein according to claim 1 1 or 12 having the formula 

Fc - Ln - X 

wherein 

Fc is the Fc portion of an immunoglobulin molecule (antibody), 

20 X is a non-immunoglobulin target polypeptide 

L is a linker peptide, 

n = 0 or 1, and 

wherein X and / or L comprises amino acid residue modifications which elicit a reduced 
immunogenicity compared to the parent molecule. 



25 



14. A modified fusion protein according to claim 13, wherein at least X has no a reduced 
immunogenicity. 



30 



15. A modified molecule according to claim 14, wherein furthermore the fusion region 
between Fc and X and optionally Fc and L and / or L and X has no or a reduced 
number of T-cell epitopes. 
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16. A modified fusion protein according to any of the claims 1-10 having the formula 

A - L„ - X 

wherein 

A is a whole antibody or its sFv, Fab, Fab\ F(ab'>2 fragments 

X is a non-immunoglobulin target polypeptide 

L is a linker peptide, 

n = 0 or 1 , and 

wherein A and / or X and / or L comprise amino acid residue modifications which elicit a 
reduced immunogenicity compared to the parent molecule. 

17. A modified fusion protein according to claim 16, wherein at least A or X has no or a 
reduced number of T-cell epitopes. 



18. A modified molecule according to claim 17, wherein furthermore the fusion region 
15 between A and X and optionally A and L and /or L and X has no or a reduced 

immunogenicity. 

19. A modified fusion protein according to any of the claims 16 - 18, wherein A is selected 
from the group: 

20 anti- EGF receptor (HER 1) antibodies 

anti- HER2 antibodies 

anti- CDx antibodies 

anti- cytokine receptor antibodies 

anti- 17-1 A antibodies, 
25 anti- KSA antibodies 

anti-GP Ilb/IIIa antibodies 

anti-integrin receptor antibodies 

anti VEGF receptor antibodies. 

30 20. A modified fusion protein according to claim 19, wherein the antibody is selected from 
the group: 

monoclonal antibody 225 and derivatives, 
monoclonal antibody 425 and derivatives 
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monoclonal antibody KS 1/4 and derivatives 
monoclonal antibody 14.18 and derivatives 
monoclonal antibody 4D5 / HER2 (Herceptin®) and derivatives 
monoclonal antibody 17-1 A and derivatives 
5 monoclonal antibody 7E3 and derivatives 

monoclonal antibodies LM609, P1F6 and 14D9.F8 and derivatives 

monoclonal antibody DC- 101 and derivatives 

monoclonal anti-Il-2R antibody (Zenapax®) and derivatives 

10 21. A modified fusion protein according to any of the claims 1 - 20, wherein the target 
polypeptide X is selected from the group: 

cytokines, integrin inhibitors, soluble cytokine receptors, glycoproteins, hormones, 
glycoprotein hormones, leptin, growth hormones, growth factors, anti-hemophilic factors, 
antigens, cytokine receptor antagonists. 

15 

22. A modified fusion protein according to claim 21, wherein the target polypeptide X is 
selected from the group: 

EL-2, G-CSF, GM-CSF, EPO, TPO, TNFa, soluble TNF receptor, DL-12, 1L-8, FGF, 
TGF, EGF, VEGF, PMSA, IGF, insulin, hGH, RGD-peptides, endostatin, angiostatin, 
20 BDNF, CNTF, protein c, factor IX, and 

and biologically active fragments thereof. 

23. A modified fusion protein according to any of the claims 1 - 22, selected from the group: 
MAb KS 1/4 - DL2, MAb 14.18 - DL2 

25 MAb 425 - EL2, MAb c425 - EL2, MAb h425 - EL2, MAb 425 - TNFa 

MAb 225 - EL2, MAb c225 - EL2 

MAb 4D5 - IL2, MAb DC101 - 112, MAb LM609 - DL2, 

Fc - EL2 , Fc - TNFa, Fc - G-CSF, Fc - EPO, Fc - Leptin, Fc - KGF, 

Fc - BDNF, Fc-CNTF, FC - B-Cerebrosidase, Fc - TPO, Fc - GM-CSF, 

30 

24. A DNA sequence encoding a fusion protein of any of the claims 1-23 and 36. 



WO02066514 f file:/AVdcwas03\firmdata\lp\FolevPat\PatentDocuments\WO02066514.cpc 1 

WO 02/066514 



PCT/EP02/01690 



Pa g e 84 of 92 



- 83 - 

25. A DNA sequence of claim 24, comprising 

(i) a signal sequence 

(ii) a DNA sequence encoding all domains or a Fc, sFV, Fab, Fab' or F(ab')2 domain of an 
IgGl, IgG2 or IgG3 antibody, and 

5 (ii) a DNA sequence encoding the polypeptide (X), and optionally 

(iii) a DNA sequence encoding the linker molecule. 

26. An expression vector comprising a DNA sequence of claims 24 or 25. 

10 27. A pharmaceutical composition comprising a fusion protein according any of the claims 1 
- 23 and 36, optionally together with a suitable carrier, excipient or diluent. 

28. A method for preparing an immunogenicly modified fusion protein as specified in of the 
claims 1-23 comprising the steps: 

15 (i) determining the amino acid sequence of the parent fusion protein or part thereof; 

(ii) identifying one or more potential T-cell epitopes within the amino acid sequence of 
the fusion protein by any method including determination of the binding of the peptides to 
MHC molecules using in vitro or in silico techniques or biological assays, (iii) designing 
new sequence variants by alteration of at least one amino acid residue within the 

20 originally identified T-cell epitope sequences, said variants are modified in such a way to 

substantially reduce or eliminate the activity or number of the T-cell epitope sequences 
and / or the number of MHC allotypes able to bind peptides derived from said biological 
molecule as determined by the binding of the peptides to MHC molecules using in vitro or 
in silico techniques or biological assays or by binding of peptide-MHC complexes to T- 

25 cells, (iv) constructing such sequence variants by recombinant DNA techniques and 

testing said variants in order to identify one or more variants with desirable properties, 
and (v) optionally repeating steps (ii) - (iv), 

characterized in that the identification of T-cell epitope sequences according to step (ii) is 
achieved by 

30 (a) selecting a region of the peptide having a known amino acid residue sequence; 

(b) sequentially sampling overlapping amino acid residue segments of predetermined 
uniform size and constituted by at least three amino acid residues from the selected 
region; (c) calculating MHC Class II molecule binding score for each said sampled 
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segment by summing assigned values for each hydrophobic amino acid residue side chain 
present in said sampled amino acid residue segment; and (d) identifying at least one of 
said segments suitable for modification, based on the calculated MHC Class II molecule 
binding score for that segment, to change overall MHC Class II binding score for the 
5 peptide without substantially the reducing therapeutic utility of the peptide. 

29. The method according to claim 28, wherein step (c) is carried out by using a BOhm 
scoring function modified to include 12-6 van der Waal's ligand-protein energy repulsive 
term and ligand conformational energy term by (1) providing a first data base of MHC 

10 Class II molecule models; (2) providing a second data base of allowed peptide backbones 

for said MHC Class II molecule models; (3) selecting a model from said first data base; 
(4) selecting an allowed peptide backbone from said second data base; (5) identifying 
amino acid residue side chains present in each sampled segment; (6) determining the 
binding affinity value for all side chains present in each sampled segment; and optionally 

15 (7) repeating steps (1) through (5) for each said model and each said backbone. 

30. The method of claim 28 or 29, wherein the sampled amino acid residue segment is 
constituted by 13 amino acid residues. 

20 31. The method of any of the claims 28 - 30, wherein consecutive sampled amino acid 
residue segments overlap by one to five amino acid residues. 

32. The method of any of the claims 28 - 3 1, wherein 1 - 9 amino acid residues in any of the 
originally present T-cell epitope sequences are altered. 

25 

33. The method according to claim 32, wherein one amino acid residue in any of the 
originally present T-cell epitope sequences is altered. 

34. The method of claim 32 or 33, wherein the alteration of the amino acid residues is 

30 substitution, deletion or addition of originally present amino acid(s) residue(s) by other 

amino acid residue(s) at specific position(s). 
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35. The method of claims 34, wherein additionally further alteration by substitution, deletion 
or addition is conducted to restore biological activity of said biological molecule. 

* 

36. An immunogenicly modified artificial protein selected from the group: 

5 (i) Y - (L) - X, wherein Y is a cytokine and X, (L) is a molecule as defined above, 

(ii) P - (L) - X, wherein P is a protein with unusual glycosylation moieties and X, (L) is 
a molecule as defined above, 

(iii) A - (L) - X, wherein A is an immunoglobulin or a fragment thereof and X (L) is a 
molecule as defined above, 

10 derived from a parent artificial protein having an amino acid sequence which is different 

from that of said parent artificial protein and exhibits reduced immunogenicity by a 
reduced number of T-cell epitopes relative to the parent fusion protein when exposed to 
the immune system of a given species, wherein said T-cell epitopes are peptide sequences 
able to bind to MCH class II molecule binding groups obtainable by a method as specified 

15 in any of the claims 28 - 35. 

37. An artificial protein of claim 36, wherein at least A or X or Y or P is immunogenicly 
modified. 
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